首页 > 最新文献

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)最新文献

英文 中文
Indirection systems for shingled-recording disk drives 带状记录磁盘驱动器的间接系统
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496971
Yuval Cassuto, M. Sanvido, Cyril Guyot, David R. Hall, Z. Bandic
Shingled magnetic recording is a promising technology to increase the capacity of hard-disk drives with no significant cost impact. Its main drawback is that random-write access to the disk is restricted due to overlap in the layout of data tracks. For computing and storage systems to enjoy the increased capacity, it is necessary to mitigate these access restrictions, and present a storage device that serves unrestricted read/write requests with adequate performance. This paper proposes two different indirection systems to mask access restrictions and optimize performance. The first one is a diskcache based architecture that provides unrestricted access with manageable drop in performance. A second, more complex indirection system, utilizes a new storage unit called S-block. It is shown that the S-block architecture allows good sustained random-write performance, a point where the disk-cache architecture fails. The organization and algorithms of both architectures are specified in detail. Each was implemented and simulated as a discrete-event simulation, mimicking its operation on real storage devices. For the performance evaluation both synthetic workloads and traces from real workloads were used.
瓦式磁记录是一种很有前途的技术,可以在不影响成本的情况下增加硬盘驱动器的容量。它的主要缺点是,由于数据轨道布局的重叠,对磁盘的随机写访问受到限制。为了使计算和存储系统享受到增加的容量,有必要减轻这些访问限制,并提供具有足够性能的不受限制的读/写请求的存储设备。本文提出了两种不同的间接系统来掩盖访问限制和优化性能。第一种是基于磁盘缓存的体系结构,它提供不受限制的访问,但性能下降是可管理的。第二种更复杂的间接系统,利用一种叫做S-block的新存储单元。结果表明,s块体系结构允许良好的持续随机写入性能,这是磁盘缓存体系结构失败的地方。详细介绍了两种体系结构的组织和算法。每个都作为离散事件模拟实现和模拟,模拟其在真实存储设备上的操作。对于性能评估,我们使用了合成工作负载和来自真实工作负载的跟踪。
{"title":"Indirection systems for shingled-recording disk drives","authors":"Yuval Cassuto, M. Sanvido, Cyril Guyot, David R. Hall, Z. Bandic","doi":"10.1109/MSST.2010.5496971","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496971","url":null,"abstract":"Shingled magnetic recording is a promising technology to increase the capacity of hard-disk drives with no significant cost impact. Its main drawback is that random-write access to the disk is restricted due to overlap in the layout of data tracks. For computing and storage systems to enjoy the increased capacity, it is necessary to mitigate these access restrictions, and present a storage device that serves unrestricted read/write requests with adequate performance. This paper proposes two different indirection systems to mask access restrictions and optimize performance. The first one is a diskcache based architecture that provides unrestricted access with manageable drop in performance. A second, more complex indirection system, utilizes a new storage unit called S-block. It is shown that the S-block architecture allows good sustained random-write performance, a point where the disk-cache architecture fails. The organization and algorithms of both architectures are specified in detail. Each was implemented and simulated as a discrete-event simulation, mimicking its operation on real storage devices. For the performance evaluation both synthetic workloads and traces from real workloads were used.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121580176","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
Security Aware Partitioning for efficient file system search 安全意识分区,高效的文件系统搜索
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496990
Aleatha Parker-Wood, Christina E. Strong, E. L. Miller, D. Long
Index partitioning techniques-where indexes are broken into multiple distinct sub-indexes-are a proven way to improve metadata search speeds and scalability for large file systems, permitting early triage of the file system. A partitioned metadata index can rule out irrelevant files and quickly focus on files that are more likely to match the search criteria. Also, in a large file system that contains many users, a user's search should not include confidential files the user doesn't have permission to view. To meet these two parallel goals, we propose a new partitioning algorithm, Security Aware Partitioning, that integrates security with the partitioning method to enable efficient and secure file system search. In order to evaluate our claim of improved efficiency, we compare the results of Security Aware Partitioning to six other partitioning methods, including implementations of the metadata partitioning algorithms of SmartStore and Spyglass, two recent systems doing partitioned search in similar environments. We propose a general set of criteria for comparing partitioning algorithms, and use them to evaluate the partitioning algorithms. Our results show that Security Aware Partitioning can provide excellent search performance at a low computational cost to build indexes, O(n). Based on metrics such as information gain, we also conclude that expensive clustering algorithms do not offer enough benefit to make them worth the additional cost in time and memory.
索引分区技术(将索引分解为多个不同的子索引)是一种经过验证的方法,可以提高大型文件系统的元数据搜索速度和可伸缩性,允许对文件系统进行早期分类。分区元数据索引可以排除不相关的文件,并快速关注更有可能匹配搜索条件的文件。此外,在包含许多用户的大型文件系统中,用户的搜索不应该包括用户没有权限查看的机密文件。为了满足这两个并行的目标,我们提出了一种新的分区算法,即安全感知分区,它将安全性与分区方法相结合,以实现高效和安全的文件系统搜索。为了评估我们所声称的提高效率,我们将安全感知分区的结果与其他六种分区方法进行了比较,包括SmartStore和Spyglass的元数据分区算法的实现,这是两个在类似环境中进行分区搜索的最新系统。我们提出了一套比较划分算法的通用准则,并用它们来评价划分算法。我们的结果表明,安全感知分区能够以较低的构建索引的计算成本(O(n))提供出色的搜索性能。基于诸如信息增益之类的度量,我们还得出结论,昂贵的聚类算法不能提供足够的好处,因此不值得在时间和内存上花费额外的成本。
{"title":"Security Aware Partitioning for efficient file system search","authors":"Aleatha Parker-Wood, Christina E. Strong, E. L. Miller, D. Long","doi":"10.1109/MSST.2010.5496990","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496990","url":null,"abstract":"Index partitioning techniques-where indexes are broken into multiple distinct sub-indexes-are a proven way to improve metadata search speeds and scalability for large file systems, permitting early triage of the file system. A partitioned metadata index can rule out irrelevant files and quickly focus on files that are more likely to match the search criteria. Also, in a large file system that contains many users, a user's search should not include confidential files the user doesn't have permission to view. To meet these two parallel goals, we propose a new partitioning algorithm, Security Aware Partitioning, that integrates security with the partitioning method to enable efficient and secure file system search. In order to evaluate our claim of improved efficiency, we compare the results of Security Aware Partitioning to six other partitioning methods, including implementations of the metadata partitioning algorithms of SmartStore and Spyglass, two recent systems doing partitioned search in similar environments. We propose a general set of criteria for comparing partitioning algorithms, and use them to evaluate the partitioning algorithms. Our results show that Security Aware Partitioning can provide excellent search performance at a low computational cost to build indexes, O(n). Based on metrics such as information gain, we also conclude that expensive clustering algorithms do not offer enough benefit to make them worth the additional cost in time and memory.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115675821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
dedupv1: Improving deduplication throughput using solid state drives (SSD) dedupv1:使用SSD (solid state drives)提高重复数据删除吞吐量
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496992
Dirk Meister, A. Brinkmann
Data deduplication systems discover and remove redundancies between data blocks. The search for redundant data blocks is often based on hashing the content of a block and comparing the resulting hash value with already stored entries inside an index. The limited random IO performance of hard disks limits the overall throughput of such systems, if the index does not fit into main memory. This paper presents the architecture of the dedupv1 dedupli-cation system that uses solid-state drives (SSDs) to improve its throughput compared to disk-based systems. dedupv1 is designed to use the sweet spots of SSD technology (random reads and sequential operations), while avoiding random writes inside the data path. This is achieved by using a hybrid deduplication design. It is an inline deduplication system as it performs chunking and fingerprinting online and only stores new data, but it is able to delay much of the processing as well as IO operations.
重复数据删除系统发现并消除数据块之间的冗余。对冗余数据块的搜索通常基于对块的内容进行散列,并将结果散列值与索引中已存储的条目进行比较。如果索引不适合主存,那么硬盘有限的随机IO性能会限制此类系统的总体吞吐量。本文介绍了使用固态驱动器(ssd)的dedupv1重复数据删除系统的体系结构,与基于磁盘的系统相比,该系统可以提高吞吐量。dedupv1旨在使用SSD技术的最佳点(随机读取和顺序操作),同时避免数据路径内的随机写入。这是通过使用混合重复数据删除设计实现的。它是一个内联重复数据删除系统,因为它在线执行分块和指纹识别,并且只存储新数据,但是它能够延迟大部分处理和IO操作。
{"title":"dedupv1: Improving deduplication throughput using solid state drives (SSD)","authors":"Dirk Meister, A. Brinkmann","doi":"10.1109/MSST.2010.5496992","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496992","url":null,"abstract":"Data deduplication systems discover and remove redundancies between data blocks. The search for redundant data blocks is often based on hashing the content of a block and comparing the resulting hash value with already stored entries inside an index. The limited random IO performance of hard disks limits the overall throughput of such systems, if the index does not fit into main memory. This paper presents the architecture of the dedupv1 dedupli-cation system that uses solid-state drives (SSDs) to improve its throughput compared to disk-based systems. dedupv1 is designed to use the sweet spots of SSD technology (random reads and sequential operations), while avoiding random writes inside the data path. This is achieved by using a hybrid deduplication design. It is an inline deduplication system as it performs chunking and fingerprinting online and only stores new data, but it is able to delay much of the processing as well as IO operations.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121748719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 92
High performance solid state storage under Linux Linux下的高性能固态存储
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496976
Eric Seppanen, M. O'Keefe, D. Lilja
Solid state drives (SSDs) allow single-drive performance that is far greater than disks can produce. Their low latency and potential for parallel operations mean that they are able to read and write data at speeds that strain operating system I/O interfaces. Additionally, their performance characteristics expose gaps in existing benchmarking methodologies. We discuss the impact on Linux system design of a prototype PCI Express SSD that operates at least an order of magnitude faster than most drives available today. We develop benchmarking strategies and focus on several areas where current Linux systems need improvement, and suggest methods of taking full advantage of such high-performance solid state storage. We demonstrate that an SSD can perform with high throughput, high operation rates, and low latency under the most difficult conditions. This suggests that high-performance SSDs can dramatically improve parallel I/O performance for future high performance computing (HPC) systems.
固态硬盘(ssd)允许单驱动器的性能远远超过磁盘所能提供的性能。它们的低延迟和潜在的并行操作意味着它们能够以使操作系统I/O接口紧张的速度读取和写入数据。此外,它们的性能特征暴露了现有基准测试方法中的差距。我们讨论了一个原型PCI Express SSD对Linux系统设计的影响,它的运行速度至少比目前可用的大多数驱动器快一个数量级。我们开发了基准测试策略,重点关注当前Linux系统需要改进的几个领域,并提出了充分利用这种高性能固态存储的方法。我们证明了SSD可以在最困难的条件下以高吞吐量、高运行率和低延迟执行。这表明高性能ssd可以显著提高未来高性能计算(HPC)系统的并行I/O性能。
{"title":"High performance solid state storage under Linux","authors":"Eric Seppanen, M. O'Keefe, D. Lilja","doi":"10.1109/MSST.2010.5496976","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496976","url":null,"abstract":"Solid state drives (SSDs) allow single-drive performance that is far greater than disks can produce. Their low latency and potential for parallel operations mean that they are able to read and write data at speeds that strain operating system I/O interfaces. Additionally, their performance characteristics expose gaps in existing benchmarking methodologies. We discuss the impact on Linux system design of a prototype PCI Express SSD that operates at least an order of magnitude faster than most drives available today. We develop benchmarking strategies and focus on several areas where current Linux systems need improvement, and suggest methods of taking full advantage of such high-performance solid state storage. We demonstrate that an SSD can perform with high throughput, high operation rates, and low latency under the most difficult conditions. This suggests that high-performance SSDs can dramatically improve parallel I/O performance for future high performance computing (HPC) systems.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125643030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Scalable storage support for data stream processing 数据流处理的可扩展存储支持
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496977
Zoe Sebepou, K. Magoutis
Continuous data stream processing systems have offered limited support for data persistence in the past, for three main reasons: First, online, real-time queries examine current streaming data and (under the assumption of no server failures) do not require access to past data; second, stable storage devices are commonly thought to be constraining system throughput and response times when compared to main memory, and are thus kept off the common path; finally, the use of scalable storage solutions which would be required to sustain high data streaming rates have not been thoroughly investigated in the past. Our work advances the state of the art by providing data streaming systems with a scalable path to persistent storage. This path has low impact in the performance properties of a scalable streaming system and allows two fundamental enhancements to their capabilities: First, it allows stream persistence for reference/archival purposes (in other words, queries can now be applied on past data on-demand); second, fault tolerance is achievable by checkpointing and stream replay schemes that are not constrained by the size of main memory.
过去,连续数据流处理系统对数据持久性提供的支持有限,主要有三个原因:首先,在线、实时查询检查当前流数据,并且(假设没有服务器故障)不需要访问过去的数据;其次,与主存相比,稳定的存储设备通常被认为限制了系统吞吐量和响应时间,因此被排除在公共路径之外;最后,使用可扩展的存储解决方案来维持高数据流速率在过去并没有得到彻底的研究。我们的工作通过为数据流系统提供通向持久存储的可扩展路径,推动了技术的发展。这条路径对可扩展流系统的性能影响很小,并允许对其功能进行两个基本的增强:首先,它允许用于参考/存档目的的流持久化(换句话说,现在可以按需对过去的数据应用查询);其次,容错可以通过不受主存大小限制的检查点和流重放方案来实现。
{"title":"Scalable storage support for data stream processing","authors":"Zoe Sebepou, K. Magoutis","doi":"10.1109/MSST.2010.5496977","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496977","url":null,"abstract":"Continuous data stream processing systems have offered limited support for data persistence in the past, for three main reasons: First, online, real-time queries examine current streaming data and (under the assumption of no server failures) do not require access to past data; second, stable storage devices are commonly thought to be constraining system throughput and response times when compared to main memory, and are thus kept off the common path; finally, the use of scalable storage solutions which would be required to sustain high data streaming rates have not been thoroughly investigated in the past. Our work advances the state of the art by providing data streaming systems with a scalable path to persistent storage. This path has low impact in the performance properties of a scalable streaming system and allows two fundamental enhancements to their capabilities: First, it allows stream persistence for reference/archival purposes (in other words, queries can now be applied on past data on-demand); second, fault tolerance is achievable by checkpointing and stream replay schemes that are not constrained by the size of main memory.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116308900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Flat XOR-based erasure codes in storage systems: Constructions, efficient recovery, and tradeoffs
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496983
K. Greenan, Xiaozhou Li, J. Wylie
Large scale storage systems require multi-disk fault tolerant erasure codes. Replication and RAID extensions that protect against two- and three-disk failures offer a stark tradeoff between how much data must be stored, and how much data must be read to recover a failed disk. Flat XOR-codes-erasure codes in which parity disks are calculated as the XOR of some subset of data disks-offer a tradeoff between these extremes. In this paper, we describe constructions of two novel flat XOR-code, Stepped Combination and HD-Combination codes. We describe an algorithm for flat XOR-codes that enumerates recovery equations, i.e., sets of disks that can recover a failed disk. We also describe two algorithms for flat XOR-codes that generate recovery schedules, i.e., sets of recovery equations that can be used in concert to achieve efficient recovery. Finally, we analyze the key storage properties of many flat XOR-codes and of MDS codes such as replication and RAID 6 to show the cost-benefit tradeoff gap that flat XOR-codes can fill.
大规模存储系统需要多盘容错纠删码。防止两盘和三盘故障的复制和RAID扩展在必须存储多少数据和必须读取多少数据以恢复故障磁盘之间提供了一个明显的权衡。扁平异或码——将奇偶校验磁盘计算为某些数据磁盘子集的异或的擦除码——提供了这两种极端之间的折衷。本文描述了两种新颖的平面异或码的构造:阶梯组合码和hd组合码。我们描述了一个平面异或码的算法,该算法列举了恢复方程,即可以恢复故障磁盘的磁盘集。我们还描述了用于生成恢复调度的平面异或码的两种算法,即可以协同使用以实现有效恢复的恢复方程集。最后,我们分析了许多平面异或码和MDS码(如复制和RAID 6)的关键存储特性,以显示平面异或码可以填补的成本-收益权衡差距。
{"title":"Flat XOR-based erasure codes in storage systems: Constructions, efficient recovery, and tradeoffs","authors":"K. Greenan, Xiaozhou Li, J. Wylie","doi":"10.1109/MSST.2010.5496983","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496983","url":null,"abstract":"Large scale storage systems require multi-disk fault tolerant erasure codes. Replication and RAID extensions that protect against two- and three-disk failures offer a stark tradeoff between how much data must be stored, and how much data must be read to recover a failed disk. Flat XOR-codes-erasure codes in which parity disks are calculated as the XOR of some subset of data disks-offer a tradeoff between these extremes. In this paper, we describe constructions of two novel flat XOR-code, Stepped Combination and HD-Combination codes. We describe an algorithm for flat XOR-codes that enumerates recovery equations, i.e., sets of disks that can recover a failed disk. We also describe two algorithms for flat XOR-codes that generate recovery schedules, i.e., sets of recovery equations that can be used in concert to achieve efficient recovery. Finally, we analyze the key storage properties of many flat XOR-codes and of MDS codes such as replication and RAID 6 to show the cost-benefit tradeoff gap that flat XOR-codes can fill.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"48 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129307364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 96
The Linear Tape File System 线性磁带文件系统
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496989
David A. Pease, A. Amir, L. V. Real, B. Biskeborn, Michael Richmond, Atsushi Abe
While there are many financial and practical reasons to prefer tape storage over disk for various applications, the difficultly of using tape in a general way is a major inhibitor to its wider usage. We present a file system that takes advantage of a new generation of tape hardware to provide efficient access to tape using standard, familiar system tools and interfaces. The Linear Tape File System (LTFS) makes using tape as easy, flexible, portable, and intuitive as using other removable and sharable media, such as a USB drive.
虽然在各种应用中,有许多经济和实际的原因使人们更喜欢磁带存储而不是磁盘,但在一般情况下使用磁带的困难是阻碍其广泛使用的主要因素。我们提出了一个利用新一代磁带硬件的文件系统,使用标准的、熟悉的系统工具和接口提供对磁带的有效访问。线性磁带文件系统(LTFS)使得使用磁带像使用其他可移动和可共享媒体(如USB驱动器)一样简单、灵活、便携和直观。
{"title":"The Linear Tape File System","authors":"David A. Pease, A. Amir, L. V. Real, B. Biskeborn, Michael Richmond, Atsushi Abe","doi":"10.1109/MSST.2010.5496989","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496989","url":null,"abstract":"While there are many financial and practical reasons to prefer tape storage over disk for various applications, the difficultly of using tape in a general way is a major inhibitor to its wider usage. We present a file system that takes advantage of a new generation of tape hardware to provide efficient access to tape using standard, familiar system tools and interfaces. The Linear Tape File System (LTFS) makes using tape as easy, flexible, portable, and intuitive as using other removable and sharable media, such as a USB drive.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128633272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation 通过隐藏地址转换,以块映射FTL为代价实现页映射FTL性能
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496970
Yang Hu, Hong Jiang, D. Feng, Lei Tian, Shu Ping Zhang, Jingning Liu, Wei Tong, Yi Qin, Liuzheng Wang
Flash Translation Layer (FTL) is one of the most important components of SSD, whose main purpose is to perform logical to physical address translation in a way that is suitable to the unique physical characteristics of the Flash memory technology. The pure page-mapping FTL scheme, arguably the best FTL scheme due to its ability to map any logical page number (LPN) to any physical page number (PPN) to minimize erase operations, cannot be practically deployed since it consumes a prohibitively large RAM (SRAM or DRAM) space to store the page-mapping table for an SSD of moderate to large size. Alternatives to the pure page-mapping FTL, such as block-mapping FTLs, hybrid FTLs (e.g., FAST) and the latest demand-based page-mapping FTLs (e.g., DFTL), require significantly less RAM space but suffer from a few performance issues. Block-mapping FTLs perform poorly with higher erasure counts, particularly under random write workloads. Hybrid FTL schemes incur costly merge operations that hurt performance and increase the erasure counts. Performances of demand-based FTLs heavily depend on workload characteristics such as access locality, read/write ratio and request arrival interval time. This paper proposes a new FTL scheme, called HAT, to achieve the performance of a pure page-mapping FTL at the RAM cost of a block-mapping FTL while consuming lower energy, by hiding the address translation (HAT). The basic idea behind our scheme is to create a separate access path to read/write the address mapping information to significantly Hide the Address-Translation latency by incorporating a low energy-consuming solid-state memory device that stores the entire page mapping table. We implement an SSD simulator, SSDsim, to validate our HAT design and evaluate its performance. The extensive trace-driven simulation results show that the performance of HAT is within 0.8% of the pure page-mapping FTL, while consuming about 50% of the energy.
Flash Translation Layer (FTL)是SSD最重要的组成部分之一,其主要作用是根据闪存技术独特的物理特性,完成逻辑地址到物理地址的转换。纯粹的页面映射FTL方案,可以说是最好的FTL方案,因为它能够将任何逻辑页码(LPN)映射到任何物理页码(PPN),以最小化擦除操作,不能实际部署,因为它消耗了非常大的RAM (SRAM或DRAM)空间来存储中等到大尺寸的SSD的页面映射表。纯页面映射FTL的替代方案,如块映射FTL、混合FTL(例如FAST)和最新的基于需求的页面映射FTL(例如DFTL),需要的RAM空间显著减少,但存在一些性能问题。块映射ftl在更高的擦除计数下表现不佳,特别是在随机写工作负载下。混合FTL方案会导致代价高昂的合并操作,这会损害性能并增加擦除计数。基于需求的ftl的性能在很大程度上取决于工作负载特征,如访问位置、读/写比率和请求到达间隔时间。本文提出了一种新的FTL方案,称为HAT,通过隐藏地址转换(HAT),以块映射FTL的RAM成本实现纯页面映射FTL的性能,同时消耗更低的能量。我们的方案背后的基本思想是创建一个单独的访问路径来读/写地址映射信息,通过结合存储整个页面映射表的低能耗固态内存设备来显着隐藏地址转换延迟。我们实现了一个SSD模拟器SSDsim来验证我们的HAT设计并评估其性能。广泛的跟踪驱动仿真结果表明,HAT的性能在纯页面映射FTL的0.8%以内,而能耗约为50%。
{"title":"Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation","authors":"Yang Hu, Hong Jiang, D. Feng, Lei Tian, Shu Ping Zhang, Jingning Liu, Wei Tong, Yi Qin, Liuzheng Wang","doi":"10.1109/MSST.2010.5496970","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496970","url":null,"abstract":"Flash Translation Layer (FTL) is one of the most important components of SSD, whose main purpose is to perform logical to physical address translation in a way that is suitable to the unique physical characteristics of the Flash memory technology. The pure page-mapping FTL scheme, arguably the best FTL scheme due to its ability to map any logical page number (LPN) to any physical page number (PPN) to minimize erase operations, cannot be practically deployed since it consumes a prohibitively large RAM (SRAM or DRAM) space to store the page-mapping table for an SSD of moderate to large size. Alternatives to the pure page-mapping FTL, such as block-mapping FTLs, hybrid FTLs (e.g., FAST) and the latest demand-based page-mapping FTLs (e.g., DFTL), require significantly less RAM space but suffer from a few performance issues. Block-mapping FTLs perform poorly with higher erasure counts, particularly under random write workloads. Hybrid FTL schemes incur costly merge operations that hurt performance and increase the erasure counts. Performances of demand-based FTLs heavily depend on workload characteristics such as access locality, read/write ratio and request arrival interval time. This paper proposes a new FTL scheme, called HAT, to achieve the performance of a pure page-mapping FTL at the RAM cost of a block-mapping FTL while consuming lower energy, by hiding the address translation (HAT). The basic idea behind our scheme is to create a separate access path to read/write the address mapping information to significantly Hide the Address-Translation latency by incorporating a low energy-consuming solid-state memory device that stores the entire page mapping table. We implement an SSD simulator, SSDsim, to validate our HAT design and evaluate its performance. The extensive trace-driven simulation results show that the performance of HAT is within 0.8% of the pure page-mapping FTL, while consuming about 50% of the energy.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115320770","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 57
Automated lookahead data migration in SSD-enabled multi-tiered storage systems 支持ssd的多层存储系统中的自动前瞻数据迁移
Pub Date : 2010-05-03 DOI: 10.1109/MSST.2010.5496999
Gong Zhang, Lawrence Chiu, Clem Dickey, Ling Liu, P. Muench, S. Seshadri
The significant IO improvements of Solid State Disks (SSD) over traditional rotational hard disks makes it an attractive approach to integrate SSDs in tiered storage systems for performance enhancement. However, to integrate SSD into multi-tiered storage system effectively, automated data migration between SSD and HDD plays a critical role. In many real world application scenarios like banking and supermarket environments, workload and IO profile present interesting characteristics and also bear the constraint of workload deadline. How to fully release the power of data migration while guaranteeing the migration deadline is critical to maximizing the performance of SSD-enabled multi-tiered storage system. In this paper, we present an automated, deadline-aware, lookahead migration scheme to address the data migration challenge. We analyze the factors that may impact on the performance of lookahead migration efficiency and develop a greedy algorithm to adaptively determine the optimal lookahead window size to optimize the effectiveness of lookahead migration, aiming at improving overall system performance and resource utilization while meeting workload deadlines. We compare our lookahead migration approach with the basic migration model and validate the effectiveness and efficiency of our adaptive lookahead migration approach through a trace driven experimental study.
与传统的旋转硬盘相比,固态硬盘(SSD)在IO方面的显著改进使得将固态硬盘集成到分级存储系统中以提高性能成为一种有吸引力的方法。然而,为了将SSD有效地集成到多层存储系统中,数据在SSD和HDD之间的自动迁移至关重要。在许多实际应用场景中,如银行和超市环境中,工作负载和IO配置文件呈现出有趣的特征,同时也受到工作负载截止日期的约束。如何在保证迁移期限的前提下充分释放数据迁移的力量,是实现ssd多层存储系统性能最大化的关键。在本文中,我们提出了一个自动化的、截止日期感知的、前瞻性的迁移方案来解决数据迁移的挑战。分析了影响前瞻迁移效率性能的因素,提出了一种自适应确定最优前瞻窗口大小的贪心算法,以优化前瞻迁移的有效性,在满足工作负载期限的前提下提高系统整体性能和资源利用率。将前瞻迁移方法与基本迁移模型进行了比较,并通过跟踪驱动的实验研究验证了自适应前瞻迁移方法的有效性和高效性。
{"title":"Automated lookahead data migration in SSD-enabled multi-tiered storage systems","authors":"Gong Zhang, Lawrence Chiu, Clem Dickey, Ling Liu, P. Muench, S. Seshadri","doi":"10.1109/MSST.2010.5496999","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496999","url":null,"abstract":"The significant IO improvements of Solid State Disks (SSD) over traditional rotational hard disks makes it an attractive approach to integrate SSDs in tiered storage systems for performance enhancement. However, to integrate SSD into multi-tiered storage system effectively, automated data migration between SSD and HDD plays a critical role. In many real world application scenarios like banking and supermarket environments, workload and IO profile present interesting characteristics and also bear the constraint of workload deadline. How to fully release the power of data migration while guaranteeing the migration deadline is critical to maximizing the performance of SSD-enabled multi-tiered storage system. In this paper, we present an automated, deadline-aware, lookahead migration scheme to address the data migration challenge. We analyze the factors that may impact on the performance of lookahead migration efficiency and develop a greedy algorithm to adaptively determine the optimal lookahead window size to optimize the effectiveness of lookahead migration, aiming at improving overall system performance and resource utilization while meeting workload deadlines. We compare our lookahead migration approach with the basic migration model and validate the effectiveness and efficiency of our adaptive lookahead migration approach through a trace driven experimental study.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114024570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
S2-RAID: A new RAID architecture for fast data recovery S2-RAID:一种新的RAID架构,实现快速数据恢复
Pub Date : 2010-05-01 DOI: 10.1109/MSST.2010.5496980
Ji-guang Wan, Jibin Wang, Qing Yang, C. Xie
As disk volume grows rapidly with terabyte disk becoming a norm, RAID reconstruction time in case of a failure takes prohibitively long time. This paper presents a new RAID architecture, S2-RAID, allowing the disk array to reconstruct very quickly in case of a disk failure. The idea is to form skewed sub RAIDs (S2-RAID) in the RAID structure so that reconstruction can be done in parallel dramatically speeding up data reconstruction time and hence minimizing the chance of data loss. To make such parallel reconstruction conflict-free, each sub-RAID is formed by selecting one logic partition from each disk group with size being a prime number. We have implemented a prototype S2-RAID system in Linux operating system for the purpose of evaluating its performance potential. SPC IO traces and standard benchmarks have been used to measure the performance of S2-RAID as compared to existing baseline software RAID, MD. Experimental results show that our new S2-RAID speeds up data reconstruction time by a factor of 3 to 6 compared to the traditional RAID. At the same time, S2-RAID shows similar or better production performance than baseline RAID while online RAID reconstruction is in progress.
随着硬盘容量的快速增长,tb级硬盘已成为一种标准配置,在出现故障时重构RAID的时间非常长。本文提出了一种新的RAID结构,S2-RAID,允许磁盘阵列在磁盘故障的情况下非常快速地重建。其思想是在RAID结构中形成倾斜的子RAID (S2-RAID),以便可以并行地进行重构,从而大大加快数据重构时间,从而最大限度地减少数据丢失的机会。为了使这种并行重构无冲突,通过从每个磁盘组中选择一个大小为素数的逻辑分区来组成每个子raid。为了评估其性能潜力,我们在Linux操作系统上实现了一个S2-RAID系统的原型。与现有的基准软件RAID相比,SPC IO跟踪和标准基准测试已用于测量S2-RAID的性能。实验结果表明,与传统RAID相比,我们的新S2-RAID将数据重建时间加快了3到6倍。同时,在RAID在线重构过程中,S2-RAID的生产性能与基准RAID相当或更好。
{"title":"S2-RAID: A new RAID architecture for fast data recovery","authors":"Ji-guang Wan, Jibin Wang, Qing Yang, C. Xie","doi":"10.1109/MSST.2010.5496980","DOIUrl":"https://doi.org/10.1109/MSST.2010.5496980","url":null,"abstract":"As disk volume grows rapidly with terabyte disk becoming a norm, RAID reconstruction time in case of a failure takes prohibitively long time. This paper presents a new RAID architecture, S2-RAID, allowing the disk array to reconstruct very quickly in case of a disk failure. The idea is to form skewed sub RAIDs (S2-RAID) in the RAID structure so that reconstruction can be done in parallel dramatically speeding up data reconstruction time and hence minimizing the chance of data loss. To make such parallel reconstruction conflict-free, each sub-RAID is formed by selecting one logic partition from each disk group with size being a prime number. We have implemented a prototype S2-RAID system in Linux operating system for the purpose of evaluating its performance potential. SPC IO traces and standard benchmarks have been used to measure the performance of S2-RAID as compared to existing baseline software RAID, MD. Experimental results show that our new S2-RAID speeds up data reconstruction time by a factor of 3 to 6 compared to the traditional RAID. At the same time, S2-RAID shows similar or better production performance than baseline RAID while online RAID reconstruction is in progress.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"184 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115502554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1