When NVMe over Fabrics Meets Arm: Performance and Implications

2019 35th Symposium on Mass Storage Systems and Technologies (MSST) Pub Date : 2019-05-20 DOI:10.1109/MSST.2019.000-9

Yichen Jia, E. Anger, Feng Chen

{"title":"When NVMe over Fabrics Meets Arm: Performance and Implications","authors":"Yichen Jia, E. Anger, Feng Chen","doi":"10.1109/MSST.2019.000-9","DOIUrl":null,"url":null,"abstract":"A growing technology trend in the industry is to deploy highly capable and power-efficient storage servers based on the Arm architecture. An important driving force behind this is storage disaggregation, which separates compute and storage to different servers, enabling independent resource allocation and optimized hardware utilization. The recently released remote storage protocol specification, NVMe-over-Fabrics (NVMeoF), makes flash disaggregation possible by reducing the remote access overhead to the minimum. It is highly appealing to integrate the two promising technologies together to build an efficient Arm based storage server with NVMeoF. In this work, we have conducted a set of comprehensive experiments to understand the performance behaviors of NVMeoF on Arm-based Data Center SoC and to gain insight into the implications of their design and deployment in data centers. Our experiments show that NVMeoF delivers the promised ultra-low latency. With appropriate optimizations on both hardware and software, NVMeoF can achieve even better performance than direct attached storage. Specifically, with appropriate NIC optimizations, we have observed a throughput increase by up to 42.5% and a decrease of the 95th percentile tail latency by up to 14.6%. Based on our measurement results, we also discuss several system implications for integrating NVMeoF on Arm based platforms. Our studies show that this system solution can well balance the computation, network, and storage resources for data-center storage services. Our findings have also been reported to Arm and Broadcom for future optimizations.","PeriodicalId":391517,"journal":{"name":"2019 35th Symposium on Mass Storage Systems and Technologies (MSST)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-05-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 35th Symposium on Mass Storage Systems and Technologies (MSST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MSST.2019.000-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

A growing technology trend in the industry is to deploy highly capable and power-efficient storage servers based on the Arm architecture. An important driving force behind this is storage disaggregation, which separates compute and storage to different servers, enabling independent resource allocation and optimized hardware utilization. The recently released remote storage protocol specification, NVMe-over-Fabrics (NVMeoF), makes flash disaggregation possible by reducing the remote access overhead to the minimum. It is highly appealing to integrate the two promising technologies together to build an efficient Arm based storage server with NVMeoF. In this work, we have conducted a set of comprehensive experiments to understand the performance behaviors of NVMeoF on Arm-based Data Center SoC and to gain insight into the implications of their design and deployment in data centers. Our experiments show that NVMeoF delivers the promised ultra-low latency. With appropriate optimizations on both hardware and software, NVMeoF can achieve even better performance than direct attached storage. Specifically, with appropriate NIC optimizations, we have observed a throughput increase by up to 42.5% and a decrease of the 95th percentile tail latency by up to 14.6%. Based on our measurement results, we also discuss several system implications for integrating NVMeoF on Arm based platforms. Our studies show that this system solution can well balance the computation, network, and storage resources for data-center storage services. Our findings have also been reported to Arm and Broadcom for future optimizations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

当NVMe在fabric上遇到Arm:性能和影响

基于Arm架构部署高性能、高能效的存储服务器是业界不断发展的技术趋势。这背后的一个重要驱动力是存储分解，它将计算和存储分离到不同的服务器，从而实现独立的资源分配和优化的硬件利用率。最近发布的远程存储协议规范NVMe-over-Fabrics (NVMeoF)通过将远程访问开销降至最低，使闪存分解成为可能。将这两种有前途的技术集成在一起，构建一个基于Arm的高效NVMeoF存储服务器是非常有吸引力的。在这项工作中，我们进行了一组全面的实验，以了解NVMeoF在基于arm的数据中心SoC上的性能行为，并深入了解其设计和部署在数据中心中的影响。我们的实验表明，NVMeoF提供了承诺的超低延迟。通过对硬件和软件进行适当的优化，NVMeoF可以实现比直接连接存储更好的性能。具体来说，通过适当的NIC优化，我们观察到吞吐量增加了42.5%，第95百分位尾部延迟减少了14.6%。基于我们的测量结果，我们还讨论了在基于Arm的平台上集成NVMeoF的几个系统含义。研究表明，该方案能够很好地平衡数据中心存储业务的计算资源、网络资源和存储资源。我们的研究结果也已报告给Arm和Broadcom，以便将来进行优化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 35th Symposium on Mass Storage Systems and Technologies (MSST)

自引率

0.00%

发文量