Receiver-Driven RDMA Congestion Control by Differentiating Congestion Types in Datacenter Networks

Jiao Zhang, Jiaming Shi, Xiaolong Zhong, Zirui Wan, Yuxing Tian, Tian Pan, Tao Huang
{"title":"Receiver-Driven RDMA Congestion Control by Differentiating Congestion Types in Datacenter Networks","authors":"Jiao Zhang, Jiaming Shi, Xiaolong Zhong, Zirui Wan, Yuxing Tian, Tian Pan, Tao Huang","doi":"10.1109/ICNP52444.2021.9651938","DOIUrl":null,"url":null,"abstract":"The development of datacenter applications leads to the need for end-to-end communication with microsecond latency. As a result, RDMA is becoming prevalent in datacenter networks to mitigate the latency caused by the slow processing speed of the traditional software network stack. However, existing RDMA congestion control mechanisms are either far from optimal in simultaneously achieving high throughput and low latency or in need of additional in-network function support. In this paper, by leveraging the observation that most congestion occurs at the last hop in datacenter networks, we propose RCC, a receiver-driven rapid congestion control mechanism for RDMA networks that combines explicit assignment and iterative window adjustment. Firstly, we propose a network congestion distinguish method to classify congestions into two types, last-hop congestion and innetwork congestion. Then, an Explicit Window Assignment mechanism is proposed to solve the last-hop congestion, which enables senders to converge to a proper sending rate in one-RTT. For in-network congestion, a PID-based iterative delay-based window adjustment scheme is proposed to achieve fast convergence and near-zero queuing latency. RCC does not need additional innetwork support and is friendly to hardware implementation. In our evaluation, the overall average FCT (Flow Completion Time) of RCC is 4~79% better than Homa, ExpressPass, DCQCN, TIMELY, and HPCC.","PeriodicalId":343813,"journal":{"name":"2021 IEEE 29th International Conference on Network Protocols (ICNP)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 29th International Conference on Network Protocols (ICNP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNP52444.2021.9651938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The development of datacenter applications leads to the need for end-to-end communication with microsecond latency. As a result, RDMA is becoming prevalent in datacenter networks to mitigate the latency caused by the slow processing speed of the traditional software network stack. However, existing RDMA congestion control mechanisms are either far from optimal in simultaneously achieving high throughput and low latency or in need of additional in-network function support. In this paper, by leveraging the observation that most congestion occurs at the last hop in datacenter networks, we propose RCC, a receiver-driven rapid congestion control mechanism for RDMA networks that combines explicit assignment and iterative window adjustment. Firstly, we propose a network congestion distinguish method to classify congestions into two types, last-hop congestion and innetwork congestion. Then, an Explicit Window Assignment mechanism is proposed to solve the last-hop congestion, which enables senders to converge to a proper sending rate in one-RTT. For in-network congestion, a PID-based iterative delay-based window adjustment scheme is proposed to achieve fast convergence and near-zero queuing latency. RCC does not need additional innetwork support and is friendly to hardware implementation. In our evaluation, the overall average FCT (Flow Completion Time) of RCC is 4~79% better than Homa, ExpressPass, DCQCN, TIMELY, and HPCC.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过区分数据中心网络中的拥塞类型来控制接收端驱动的RDMA拥塞
数据中心应用程序的开发导致需要具有微秒级延迟的端到端通信。因此,RDMA在数据中心网络中变得越来越普遍,以减轻传统软件网络堆栈处理速度慢所造成的延迟。然而,现有的RDMA拥塞控制机制在同时实现高吞吐量和低延迟方面远非最佳,或者需要额外的网络内功能支持。在本文中,通过观察大多数拥塞发生在数据中心网络的最后一跳,我们提出了RCC,一种接收器驱动的RDMA网络快速拥塞控制机制,结合了显式分配和迭代窗口调整。首先,提出了一种网络拥塞判别方法,将拥塞分为最后一跳拥塞和网络内拥塞两种类型。然后,提出了一种显式窗口分配机制来解决最后一跳拥塞问题,使发送端能够在一次rtt中收敛到合适的发送速率。针对网络内拥塞问题,提出了一种基于pid的迭代延迟窗口调整方案,实现了快速收敛和接近于零的排队延迟。RCC不需要额外的网络支持,并且对硬件实现很友好。在我们的评估中,RCC的总体平均FCT (Flow Completion Time)比Homa、ExpressPass、DCQCN、TIMELY和HPCC好4~79%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploiting WiFi AP for Simultaneous Data Dissemination among WiFi and ZigBee Devices Highway On-Ramp Merging for Mixed Traffic: Recent Advances and Future Trends Generalizable and Interpretable Deep Learning for Network Congestion Prediction DNSonChain: Delegating Privacy-Preserved DNS Resolution to Blockchain ISP Self-Operated BGP Anomaly Detection Based on Weakly Supervised Learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1