Exporting kernel page caching for efficient user-level I/O

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) Pub Date : 2010-05-03 DOI:10.1109/MSST.2010.5496973

Richard P. Spillane, S. Dixit, Shrikar Archak, Saumitra Bhanage, E. Zadok

{"title":"Exporting kernel page caching for efficient user-level I/O","authors":"Richard P. Spillane, S. Dixit, Shrikar Archak, Saumitra Bhanage, E. Zadok","doi":"10.1109/MSST.2010.5496973","DOIUrl":null,"url":null,"abstract":"The modern file system is still implemented in the kernel, and is statically linked with other kernel components. This architecture has brought performance and efficient integration with memory management. However kernel development is slow and modern storage systems must support an array of features, including distribution across a network, tagging, searching, deduplication, checksumming, snap-shotting, file pre-allocation, real time I/O guarantees for media, and more. To move complex components into user-level however will require an efficient mechanism for handling page faulting and zero-copy caching, write ordering, synchronous flushes, interaction with the kernel page write-back thread, and secure shared memory. We implement such a system, and experiment with a user-level object store built on top. Our object store is a complete re-design of the traditional storage stack and demonstrates the efficiency of our technique, and the flexibility it grants to user-level storage systems. Our current prototype file system incurs between a 1% and 6% overhead on the default native file system Ext3 for in-cache system workloads. Where the native kernel file system design has traditionally found its primary motivation. For update and insert intensive metadata workloads that are out-of-cache, we perform 39 times better than the native Ext3 file system, while still performing only 2 times worse on out-of-cache random lookups.","PeriodicalId":350968,"journal":{"name":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"134 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSST.2010.5496973","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

The modern file system is still implemented in the kernel, and is statically linked with other kernel components. This architecture has brought performance and efficient integration with memory management. However kernel development is slow and modern storage systems must support an array of features, including distribution across a network, tagging, searching, deduplication, checksumming, snap-shotting, file pre-allocation, real time I/O guarantees for media, and more. To move complex components into user-level however will require an efficient mechanism for handling page faulting and zero-copy caching, write ordering, synchronous flushes, interaction with the kernel page write-back thread, and secure shared memory. We implement such a system, and experiment with a user-level object store built on top. Our object store is a complete re-design of the traditional storage stack and demonstrates the efficiency of our technique, and the flexibility it grants to user-level storage systems. Our current prototype file system incurs between a 1% and 6% overhead on the default native file system Ext3 for in-cache system workloads. Where the native kernel file system design has traditionally found its primary motivation. For update and insert intensive metadata workloads that are out-of-cache, we perform 39 times better than the native Ext3 file system, while still performing only 2 times worse on out-of-cache random lookups.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

导出内核页面缓存以实现高效的用户级I/O

现代文件系统仍然在内核中实现，并与其他内核组件静态链接。这种体系结构带来了与内存管理的性能和高效集成。然而，内核开发是缓慢的，现代存储系统必须支持一系列特性，包括跨网络分发、标记、搜索、重复数据删除、校验和、快照、文件预分配、媒体的实时I/O保证等等。然而，要将复杂的组件转移到用户级，需要一种有效的机制来处理页面错误和零拷贝缓存、写排序、同步刷新、与内核页面回写线程的交互以及安全的共享内存。我们实现了这样一个系统，并在上面构建了一个用户级对象存储。我们的对象存储是对传统存储堆栈的完全重新设计，并展示了我们的技术的效率，以及它赋予用户级存储系统的灵活性。对于缓存内系统工作负载，我们当前的原型文件系统会在默认本机文件系统Ext3上产生1%到6%的开销。本地内核文件系统设计在传统上找到了它的主要动机。对于缓存外的更新和插入密集型元数据工作负载，我们的性能比本机Ext3文件系统好39倍，而在缓存外随机查找时的性能仅差2倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST)

自引率

0.00%

发文量

期刊最新文献

Automated lookahead data migration in SSD-enabled multi-tiered storage systems Write amplification reduction in NAND Flash through multi-write coding Leveraging disk drive acoustic modes for power management Achieving page-mapping FTL performance at block-mapping FTL cost by hiding address translation Energy and thermal aware buffer cache replacement algorithm