首页 > 最新文献

2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)最新文献

英文 中文
Rejuvenator: A static wear leveling algorithm for NAND flash memory with minimized overhead Rejuvenator:一种用于NAND闪存的静态损耗均衡算法,开销最小
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937225
M. Murugan, D. Du
NAND flash memory is fast replacing traditional magnetic storage media due to its better performance and low power requirements. However the endurance of flash memory is still a critical issue in using it for large scale enterprise applications. Rethinking the basic design of NAND flash memory is essential to realize its maximum potential in large scale storage. NAND flash memory is organized as blocks and blocks in turn have pages. A block can be erased reliably only for a limited number of times and frequent block erase operations to a few blocks reduce the lifetime of the flash memory. Wear leveling helps to prevent the early wear out of blocks in the flash memory. In order to achieve efficient wear leveling, data is moved around throughout the flash memory. The existing wear leveling algorithms do not scale for large scale NAND flash based SSDs. In this paper we propose a static wear leveling algorithm, named as Rejuvenator, for large scale NAND flash memory. Rejuvenator is adaptive to the changes in workloads and minimizes the cost of expensive data migrations. Our evaluation of Rejuvenator is based on detailed simulations with large scale enterprise workloads and synthetic micro benchmarks.
NAND闪存由于其更好的性能和更低的功耗要求,正在迅速取代传统的磁性存储介质。然而,闪存的耐用性仍然是将其用于大规模企业应用的关键问题。重新思考NAND闪存的基本设计对于实现其在大规模存储中的最大潜力至关重要。NAND闪存被组织成块,块依次有页。一个块只能在有限的次数内可靠地擦除,并且对几个块进行频繁的块擦除操作减少了闪存的寿命。磨损调平有助于防止闪存块的早期磨损。为了实现有效的磨损均衡,数据在整个闪存中移动。现有的损耗均衡算法不适合大规模基于NAND闪存的ssd。本文提出了一种用于大规模NAND快闪记忆体的静态损耗均衡算法,称为Rejuvenator。Rejuvenator可以适应工作负载的变化,并将昂贵的数据迁移成本降至最低。我们对Rejuvenator的评估是基于大规模企业工作负载和合成微基准的详细模拟。
{"title":"Rejuvenator: A static wear leveling algorithm for NAND flash memory with minimized overhead","authors":"M. Murugan, D. Du","doi":"10.1109/MSST.2011.5937225","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937225","url":null,"abstract":"NAND flash memory is fast replacing traditional magnetic storage media due to its better performance and low power requirements. However the endurance of flash memory is still a critical issue in using it for large scale enterprise applications. Rethinking the basic design of NAND flash memory is essential to realize its maximum potential in large scale storage. NAND flash memory is organized as blocks and blocks in turn have pages. A block can be erased reliably only for a limited number of times and frequent block erase operations to a few blocks reduce the lifetime of the flash memory. Wear leveling helps to prevent the early wear out of blocks in the flash memory. In order to achieve efficient wear leveling, data is moved around throughout the flash memory. The existing wear leveling algorithms do not scale for large scale NAND flash based SSDs. In this paper we propose a static wear leveling algorithm, named as Rejuvenator, for large scale NAND flash memory. Rejuvenator is adaptive to the changes in workloads and minimizes the cost of expensive data migrations. Our evaluation of Rejuvenator is based on detailed simulations with large scale enterprise workloads and synthetic micro benchmarks.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128849078","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 150
Semi-RAID: A reliable energy-aware RAID data layout for sequential data access 半RAID:一种可靠的能量感知RAID数据布局,用于顺序数据访问
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937222
Li Xiao, Yufen Tan, Zhizhuo Sun
In this paper, we propose an alternative RAID data layout, Semi-RAID (S-RAID), for the kind of application that exhibits a sequential data access pattern. The data layout of S-RAID uses a grouping strategy that makes only part of the whole array active and puts the rest of the array into standby mode. In this way S-RAID can greatly reduce the power consumption and improve the reliability while still satisfying the I/O requirements of the application. We have shown that transforming a traditional RAID 5 into S-RAID 5 makes average mean time to failure (MTTF) 8 times as long as before, and reduces the power consumption to 74% in a 5-disk configuration. In a typical video surveillance system that has 25 disks, S-RAID may reduce the power consumption to nearly 15% in the best case. The performance of S-RAID is evaluated and proved to be sufficient for the workloads it is optimized for. Possible optimization for other workloads has also been given at the end of the paper.
在本文中,我们提出了另一种RAID数据布局,半RAID (S-RAID),用于显示顺序数据访问模式的应用程序。S-RAID的数据布局使用一种分组策略,该策略只使整个阵列的一部分处于活动状态,而将阵列的其余部分置于备用模式。通过这种方式,S-RAID可以大大降低功耗,提高可靠性,同时还能满足应用的I/O需求。我们已经证明,将传统RAID 5转换为S-RAID 5使平均无故障时间(MTTF)是以前的8倍,并将5个磁盘配置中的功耗降低到74%。在典型的25盘视频监控系统中,S-RAID在最佳情况下可以将功耗降低近15%。对S-RAID的性能进行了评估,并证明它足以满足其优化的工作负载。本文最后还给出了其他工作负载的可能优化。
{"title":"Semi-RAID: A reliable energy-aware RAID data layout for sequential data access","authors":"Li Xiao, Yufen Tan, Zhizhuo Sun","doi":"10.1109/MSST.2011.5937222","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937222","url":null,"abstract":"In this paper, we propose an alternative RAID data layout, Semi-RAID (S-RAID), for the kind of application that exhibits a sequential data access pattern. The data layout of S-RAID uses a grouping strategy that makes only part of the whole array active and puts the rest of the array into standby mode. In this way S-RAID can greatly reduce the power consumption and improve the reliability while still satisfying the I/O requirements of the application. We have shown that transforming a traditional RAID 5 into S-RAID 5 makes average mean time to failure (MTTF) 8 times as long as before, and reduces the power consumption to 74% in a 5-disk configuration. In a typical video surveillance system that has 25 disks, S-RAID may reduce the power consumption to nearly 15% in the best case. The performance of S-RAID is evaluated and proved to be sufficient for the workloads it is optimized for. Possible optimization for other workloads has also been given at the end of the paper.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124391602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A Forest-structured Bloom Filter with flash memory 一个森林结构的布隆过滤器与闪存
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937232
Guanlin Lu, Biplob K. Debnath, D. Du
A Bloom Filter (BF) is a data structure based on probability to compactly represent/record a set of elements (keys). It has wide applications on efficiently identifying a key that has been seen before with minimum amount of recording space used. BF is heavily used in chunking based data de-duplication. Traditionally, a BF is implemented as in-RAM data structure; hence its size is limited by the available RAM space on the machine. For certain applications like data de-duplication that require a big BF beyond the size of available RAM space, it becomes necessary to store a BF into a secondary storage device. Since BF operations are inherently random in nature, magnetic disk provides worse performance for the random read and write operations. It will not be a good fit for storing the large BF. Flash memory based Solid State Drive (SSD) has been considered as an emerging storage device that has superior performance and can potentially replace disks as the preferred secondary storage devices. However, several special characteristics of flash memory make designing a flash memory based BF very challenging. In this paper, our goal is to design an efficient flash memory based BF that is fully aware of these physical characteristics. To this end, we propose a Forest-structured BF design (FBF). FBF uses a combination of RAM and flash memory to design a BF. BF is stored on the flash, while RAM helps to mitigate the impact of slow write performance of flash memory. In addition, in-flash BF is organized in a forest-like structure in order to improve the lookup performance. Our experimental results show that FBF design achieves 2 times faster processing speed with 50% less number of flash write operations when compared with the existing flash memory based BF designs.
布隆过滤器(BF)是一种基于概率的数据结构,用于紧凑地表示/记录一组元素(键)。它在以最小的记录空间有效地识别以前见过的密钥方面具有广泛的应用。BF在基于分块的重复数据删除中得到了广泛的应用。传统上,BF是作为内存中的数据结构实现的;因此,它的大小受到机器上可用RAM空间的限制。对于某些应用程序,如数据重复删除,需要一个超过可用RAM空间大小的大BF,就有必要将BF存储到辅助存储设备中。由于高炉操作本身具有随机性,因此磁盘对随机读写操作的性能较差。它不适合储存大型高炉。基于闪存的固态硬盘(Solid State Drive, SSD)被认为是一种新兴的存储设备,具有优越的性能,有可能取代磁盘成为首选的二级存储设备。然而,闪存的一些特殊特性使得基于BF的闪存的设计非常具有挑战性。在本文中,我们的目标是设计一个高效的基于BF的闪存,充分意识到这些物理特性。为此,我们提出了一种森林结构BF设计(FBF)。FBF采用RAM和闪存相结合的方式来设计BF。BF存储在闪存上,而RAM有助于减轻闪存写入速度慢的影响。此外,为了提高查找性能,flash内BF被组织成类似森林的结构。实验结果表明,与现有基于闪存的BF设计相比,FBF设计的处理速度提高了2倍,闪存写入操作次数减少了50%。
{"title":"A Forest-structured Bloom Filter with flash memory","authors":"Guanlin Lu, Biplob K. Debnath, D. Du","doi":"10.1109/MSST.2011.5937232","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937232","url":null,"abstract":"A Bloom Filter (BF) is a data structure based on probability to compactly represent/record a set of elements (keys). It has wide applications on efficiently identifying a key that has been seen before with minimum amount of recording space used. BF is heavily used in chunking based data de-duplication. Traditionally, a BF is implemented as in-RAM data structure; hence its size is limited by the available RAM space on the machine. For certain applications like data de-duplication that require a big BF beyond the size of available RAM space, it becomes necessary to store a BF into a secondary storage device. Since BF operations are inherently random in nature, magnetic disk provides worse performance for the random read and write operations. It will not be a good fit for storing the large BF. Flash memory based Solid State Drive (SSD) has been considered as an emerging storage device that has superior performance and can potentially replace disks as the preferred secondary storage devices. However, several special characteristics of flash memory make designing a flash memory based BF very challenging. In this paper, our goal is to design an efficient flash memory based BF that is fully aware of these physical characteristics. To this end, we propose a Forest-structured BF design (FBF). FBF uses a combination of RAM and flash memory to design a BF. BF is stored on the flash, while RAM helps to mitigate the impact of slow write performance of flash memory. In addition, in-flash BF is organized in a forest-like structure in order to improve the lookup performance. Our experimental results show that FBF design achieves 2 times faster processing speed with 50% less number of flash write operations when compared with the existing flash memory based BF designs.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121960324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Design and evaluation of Oasis: An active storage framework based on T10 OSD standard 基于T10 OSD标准的主动存储框架Oasis的设计与评价
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937220
Yulai Xie, Kiran-Kumar Muniswamy-Reddy, D. Feng, D. Long, Yangwook Kang, Zhongying Niu, Zhipeng Tan
In this paper, we present the design and performance evaluation of Oasis, an active storage framework for object-based storage systems that complies with the current T10 OSD standard. In contrast with previous work, Oasis has the following advantages. First, Oasis enables users to transparently process the OSD object and supports different processing granularity (from the single object to all the objects in the OSD) by extending the OSD object attribute page defined in the T10 OSD standard. Second, Oasis provides an easy and efficient way for users to manage the application functions in the OSD by using the existing OSD commands. Third, Oasis can authorize the execution of the application function in the OSD by enhancing the T10 OSD security protocol, allowing only authorized users to use the system. We evaluate the performance and scalability of our system implementation on Oasis by running three typical applications. The results indicate that active storage far outperforms the traditional object-based storage system in applications that filter data on the OSD. We also experiment with Java based applications and C based applications. Our experiments indicate that Java based applications may be bottlenecked for I/O-intensive applications, while for applications that do not heavily rely on the I/O operations, both Java based applications and C based applications achieve comparable performance. Our microbenchmarks indicate that Oasis implementation overhead is minimal compared to the Intel OSD reference implementation, between 1.2% to 5.9% for Read commands and 0.6% to 9.9% for Write commands.
在本文中,我们介绍了Oasis的设计和性能评估,Oasis是一个基于对象存储系统的主动存储框架,符合当前的T10 OSD标准。与之前的工作相比,Oasis具有以下优势。首先,Oasis通过扩展T10 OSD标准中定义的OSD对象属性页,使用户能够透明地处理OSD对象,并支持不同的处理粒度(从单个对象到OSD中的所有对象)。其次,Oasis为用户提供了一种简单高效的方式,通过使用现有的OSD命令来管理OSD中的应用功能。第三,Oasis通过增强T10 OSD安全协议,可以授权在OSD中执行应用功能,只有授权的用户才能使用系统。我们通过运行三个典型的应用程序来评估Oasis上系统实现的性能和可伸缩性。结果表明,在OSD上过滤数据的应用中,主动存储的性能远远优于传统的基于对象的存储系统。我们还尝试了基于Java的应用程序和基于C的应用程序。我们的实验表明,基于Java的应用程序对于I/O密集型应用程序可能存在瓶颈,而对于不严重依赖I/O操作的应用程序,基于Java的应用程序和基于C的应用程序都可以实现相当的性能。我们的微基准测试表明,与Intel OSD参考实现相比,Oasis的实现开销最小,读取命令的开销在1.2%到5.9%之间,写命令的开销在0.6%到9.9%之间。
{"title":"Design and evaluation of Oasis: An active storage framework based on T10 OSD standard","authors":"Yulai Xie, Kiran-Kumar Muniswamy-Reddy, D. Feng, D. Long, Yangwook Kang, Zhongying Niu, Zhipeng Tan","doi":"10.1109/MSST.2011.5937220","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937220","url":null,"abstract":"In this paper, we present the design and performance evaluation of Oasis, an active storage framework for object-based storage systems that complies with the current T10 OSD standard. In contrast with previous work, Oasis has the following advantages. First, Oasis enables users to transparently process the OSD object and supports different processing granularity (from the single object to all the objects in the OSD) by extending the OSD object attribute page defined in the T10 OSD standard. Second, Oasis provides an easy and efficient way for users to manage the application functions in the OSD by using the existing OSD commands. Third, Oasis can authorize the execution of the application function in the OSD by enhancing the T10 OSD security protocol, allowing only authorized users to use the system. We evaluate the performance and scalability of our system implementation on Oasis by running three typical applications. The results indicate that active storage far outperforms the traditional object-based storage system in applications that filter data on the OSD. We also experiment with Java based applications and C based applications. Our experiments indicate that Java based applications may be bottlenecked for I/O-intensive applications, while for applications that do not heavily rely on the I/O operations, both Java based applications and C based applications achieve comparable performance. Our microbenchmarks indicate that Oasis implementation overhead is minimal compared to the Intel OSD reference implementation, between 1.2% to 5.9% for Read commands and 0.6% to 9.9% for Write commands.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"199 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132684216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
RAID6L: A log-assisted RAID6 storage architecture with improved write performance RAID6L:一种日志辅助的RAID6存储架构,具有改进的写性能
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937230
Chao Jin, D. Feng, Hong Jiang, Lei Tian
The RAID6 architecture is playing an increasingly important role in modern storage systems due to its provision of very high reliability. However, its high write penalty, because of the double-parity-update overheads upon each write operation, has been a persistent performance bottleneck of the RAID6 systems. In this paper, we propose a log-assisted RAID6 architecture, called RAID6L, to boost the write performance of the RAID6 systems. RAID6L integrates a log disk into the traditional RAID6 architecture, and alleviates its write penalty by simplifying the processing steps to service a write request. On the other hand, RAID6L also guarantees that the accelerated RAID6 systems can still recover from double disk failures. The Parity Logging scheme was originally proposed to accelerate the XOR based RAID5, and we propose a method to make it also applicable to the Reed-Solomon based RAID6. We present a detailed comparison between RAID6L and Parity Logging, and show that RAID6L has several advantages over Parity Logging. Experimental results show that RAID6L significantly increases the data transfer rate and decreases the request response time when compared with the traditional RAID6 and Parity Logging systems.
RAID6架构在现代存储系统中发挥着越来越重要的作用,因为它提供了非常高的可靠性。然而,由于每次写操作的双奇偶更新开销,它的高写代价一直是RAID6系统的持久性能瓶颈。在本文中,我们提出了一种日志辅助的RAID6架构,称为RAID6L,以提高RAID6系统的写性能。RAID6L将一个日志磁盘集成到传统的RAID6架构中,并通过简化处理写请求的步骤来减轻其写损失。另一方面,RAID6L还保证加速的RAID6系统仍然可以从双磁盘故障中恢复。奇偶记录方案最初是为了加速基于XOR的RAID5而提出的,我们提出了一种方法,使其也适用于基于Reed-Solomon的RAID6。我们对RAID6L和奇偶记录进行了详细的比较,并表明RAID6L比奇偶记录有几个优点。实验结果表明,与传统的RAID6和奇偶校验日志系统相比,RAID6L显著提高了数据传输率,减少了请求响应时间。
{"title":"RAID6L: A log-assisted RAID6 storage architecture with improved write performance","authors":"Chao Jin, D. Feng, Hong Jiang, Lei Tian","doi":"10.1109/MSST.2011.5937230","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937230","url":null,"abstract":"The RAID6 architecture is playing an increasingly important role in modern storage systems due to its provision of very high reliability. However, its high write penalty, because of the double-parity-update overheads upon each write operation, has been a persistent performance bottleneck of the RAID6 systems. In this paper, we propose a log-assisted RAID6 architecture, called RAID6L, to boost the write performance of the RAID6 systems. RAID6L integrates a log disk into the traditional RAID6 architecture, and alleviates its write penalty by simplifying the processing steps to service a write request. On the other hand, RAID6L also guarantees that the accelerated RAID6 systems can still recover from double disk failures. The Parity Logging scheme was originally proposed to accelerate the XOR based RAID5, and we propose a method to make it also applicable to the Reed-Solomon based RAID6. We present a detailed comparison between RAID6L and Parity Logging, and show that RAID6L has several advantages over Parity Logging. Experimental results show that RAID6L significantly increases the data transfer rate and decreases the request response time when compared with the traditional RAID6 and Parity Logging systems.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133549919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
AoE storage protocol over MPLS network 基于MPLS网络的AoE存储协议
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937231
M. Landowski, P. Curran
ATA over Ethernet (AoE) protocol is an interesting alternative to iSCSI and Fibre Channel. AoE is a light, layer 2 protocol integrated with Ethernet frames, which makes it ideal for work inside LAN segments. Unfortunately, this advantage is also its limitation when access to the AoE storage is required to be over the internetwork. In this paper we show how MPLS can make AoE routable and thereby also independent of Ethernet itself.
以太网上的ATA (AoE)协议是iSCSI和光纤通道的一个有趣的替代方案。AoE是一种轻量级的第二层协议,与以太网帧集成在一起,这使得它非常适合在局域网段内工作。不幸的是,当需要通过internet访问AoE存储时,这种优势也是它的局限性。在本文中,我们展示了MPLS如何使AoE可路由,从而也独立于以太网本身。
{"title":"AoE storage protocol over MPLS network","authors":"M. Landowski, P. Curran","doi":"10.1109/MSST.2011.5937231","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937231","url":null,"abstract":"ATA over Ethernet (AoE) protocol is an interesting alternative to iSCSI and Fibre Channel. AoE is a light, layer 2 protocol integrated with Ethernet frames, which makes it ideal for work inside LAN segments. Unfortunately, this advantage is also its limitation when access to the AoE storage is required to be over the internetwork. In this paper we show how MPLS can make AoE routable and thereby also independent of Ethernet itself.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134256327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Understanding and improving computational science storage access through continuous characterization 通过连续表征理解和改进计算科学存储访问
Pub Date : 2011-05-23 DOI: 10.1145/2027066.2027068
P. Carns, K. Harms, W. Allcock, Charles Bacon, S. Lang, R. Latham, R. Ross
Computational science applications are driving a demand for increasingly powerful storage systems. While many techniques are available for capturing the I/O behavior of individual application trial runs and specific components of the storage system, continuous characterization of a production system remains a daunting challenge for systems with hundreds of thousands of compute cores and multiple petabytes of storage. As a result, these storage systems are often designed without a clear understanding of the diverse computational science workloads they will support.
计算科学应用正在推动对日益强大的存储系统的需求。虽然有许多技术可用于捕获单个应用程序试运行和存储系统特定组件的I/O行为,但对于具有数十万个计算核心和多个pb存储的系统来说,持续表征生产系统仍然是一项艰巨的挑战。因此,在设计这些存储系统时,通常没有清楚地了解它们将支持的各种计算科学工作负载。
{"title":"Understanding and improving computational science storage access through continuous characterization","authors":"P. Carns, K. Harms, W. Allcock, Charles Bacon, S. Lang, R. Latham, R. Ross","doi":"10.1145/2027066.2027068","DOIUrl":"https://doi.org/10.1145/2027066.2027068","url":null,"abstract":"Computational science applications are driving a demand for increasingly powerful storage systems. While many techniques are available for capturing the I/O behavior of individual application trial runs and specific components of the storage system, continuous characterization of a production system remains a daunting challenge for systems with hundreds of thousands of compute cores and multiple petabytes of storage. As a result, these storage systems are often designed without a clear understanding of the diverse computational science workloads they will support.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117087045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 253
DBLK: Deduplication for primary block storage DBLK:主块存储的重复数据删除
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937237
Y. Tsuchiya, Takashi Watanabe
The deduplication block-device (DBLK) is a deduplication and compression system with a block device interface. It is used as a primary storage and block-wise deduplication is done inline. Since deduplication for primary storage requires low latency and detecting block-wise deduplication creates a large amount of metadata, it is necessary to efficiently use the memory of the system. We solved this problem by developing a multilayer Bloom filter (MBF) to reduce the size of the data structure in the memory for indexing duplicate data.
DBLK (deduplication block-device)是一种具有块设备接口的重复数据删除压缩系统。它被用作主存储,并且块重复数据删除是内联完成的。由于对主存进行重复数据删除的延迟较低,而检测分块重复数据删除会产生大量元数据,因此需要有效利用系统内存。我们通过开发多层布隆过滤器(MBF)来减少内存中用于索引重复数据的数据结构的大小,从而解决了这个问题。
{"title":"DBLK: Deduplication for primary block storage","authors":"Y. Tsuchiya, Takashi Watanabe","doi":"10.1109/MSST.2011.5937237","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937237","url":null,"abstract":"The deduplication block-device (DBLK) is a deduplication and compression system with a block device interface. It is used as a primary storage and block-wise deduplication is done inline. Since deduplication for primary storage requires low latency and detecting block-wise deduplication creates a large amount of metadata, it is necessary to efficiently use the memory of the system. We solved this problem by developing a multilayer Bloom filter (MBF) to reduce the size of the data structure in the memory for indexing duplicate data.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"5 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131958298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
Performance models of flash-based solid-state drives for real workloads 实际工作负载下基于闪存的固态硬盘的性能模型
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937227
Simona Boboila, Peter Desnoyers
There is a wide gap between the potential performance of NAND flash-based solid state drives (SSDs) and their performance in many real-world applications; understanding this gap requires knowledge of their behavior and internal algorithms for various workloads. We develop analytic models for two commonly-used Flash Translation Layer (FTL) algorithms, as used in SSDs, as well as a methodology for applying these models to real-world workloads. We demonstrate the accuracy of these models via simulation, extend this approach to incorporate measurement-based approximations when detailed parameters are unknown, and validate this methodology against real devices.
基于NAND闪存的固态硬盘(ssd)的潜在性能与其在许多实际应用中的性能之间存在很大差距;理解这种差距需要了解它们的行为和各种工作负载的内部算法。我们为ssd中使用的两种常用的闪存转换层(FTL)算法开发了分析模型,以及将这些模型应用于实际工作负载的方法。我们通过仿真证明了这些模型的准确性,在详细参数未知的情况下,将这种方法扩展到基于测量的近似值,并在实际设备上验证这种方法。
{"title":"Performance models of flash-based solid-state drives for real workloads","authors":"Simona Boboila, Peter Desnoyers","doi":"10.1109/MSST.2011.5937227","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937227","url":null,"abstract":"There is a wide gap between the potential performance of NAND flash-based solid state drives (SSDs) and their performance in many real-world applications; understanding this gap requires knowledge of their behavior and internal algorithms for various workloads. We develop analytic models for two commonly-used Flash Translation Layer (FTL) algorithms, as used in SSDs, as well as a methodology for applying these models to real-world workloads. We demonstrate the accuracy of these models via simulation, extend this approach to incorporate measurement-based approximations when detailed parameters are unknown, and validate this methodology against real devices.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"174 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116640814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Using XML and XQuery for data management in HPSS 在HPSS中使用XML和XQuery进行数据管理
Pub Date : 2011-05-23 DOI: 10.1109/MSST.2011.5937233
Michael Meseke
The XML and XQuery language capabilities of modern databases can provide a powerful and flexible method of data management within a mass storage system. Within the High Performance Storage System (HPSS), the implementation of XML and XQuery capabilities for user metadata is called “User-defined Attributes” or UDA. The UDA feature provides a method for user applications to associate arbitrary metadata with HPSS namespace objects and store it in an organized, scalable, and searchable manner using XML. The implementation includes a simple key-value interface as well as exposure of the database's XQuery interface to allow for highly customized and atomic update, retrieval, and namespace-wide search requests. Using this architecture enables HPSS to provide client applications a high degree of flexibility in the storage, management, and access of user-defined metadata. This paper describes a brief history of data management within HPSS as well as the architectural decisions, implementation, and results of the UDA feature. Also discussed are considerations for planning and management of the UDA feature, current and in development UDA solutions created by developers and customers, and possible future data management work within HPSS.
现代数据库的XML和XQuery语言功能可以在大容量存储系统中提供强大而灵活的数据管理方法。在高性能存储系统(HPSS)中,对用户元数据的XML和XQuery功能的实现称为“用户定义属性”或UDA。UDA特性为用户应用程序提供了一种方法,可以将任意元数据与HPSS名称空间对象相关联,并使用XML以有组织、可扩展和可搜索的方式存储元数据。该实现包括一个简单的键值接口以及数据库的XQuery接口的公开,以允许高度自定义的原子更新、检索和名称空间范围的搜索请求。使用这种体系结构,HPSS可以在用户定义元数据的存储、管理和访问方面为客户端应用程序提供高度的灵活性。本文简要介绍了HPSS中数据管理的历史,以及UDA特性的架构决策、实现和结果。还讨论了对UDA特性的规划和管理的考虑,开发人员和客户创建的当前和正在开发的UDA解决方案,以及HPSS中可能的未来数据管理工作。
{"title":"Using XML and XQuery for data management in HPSS","authors":"Michael Meseke","doi":"10.1109/MSST.2011.5937233","DOIUrl":"https://doi.org/10.1109/MSST.2011.5937233","url":null,"abstract":"The XML and XQuery language capabilities of modern databases can provide a powerful and flexible method of data management within a mass storage system. Within the High Performance Storage System (HPSS), the implementation of XML and XQuery capabilities for user metadata is called “User-defined Attributes” or UDA. The UDA feature provides a method for user applications to associate arbitrary metadata with HPSS namespace objects and store it in an organized, scalable, and searchable manner using XML. The implementation includes a simple key-value interface as well as exposure of the database's XQuery interface to allow for highly customized and atomic update, retrieval, and namespace-wide search requests. Using this architecture enables HPSS to provide client applications a high degree of flexibility in the storage, management, and access of user-defined metadata. This paper describes a brief history of data management within HPSS as well as the architectural decisions, implementation, and results of the UDA feature. Also discussed are considerations for planning and management of the UDA feature, current and in development UDA solutions created by developers and customers, and possible future data management work within HPSS.","PeriodicalId":136636,"journal":{"name":"2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126856636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1