{"title":"FGLB: A fine-grained hardware intra-server load balancer based on 100 G FPGA SmartNIC","authors":"Xiaoying Huang, Zhichuan Guo, Mangu Song","doi":"10.1002/nem.2211","DOIUrl":null,"url":null,"abstract":"<p>In today's data centers, workloads including multiple services and requests are processed in parallel within a server with many CPU cores. Therefore, meeting the intra-server load balancing is very important to improve the utilization of CPU resources in the data centers. However, the existing methods cannot well meet the intra-server load balancing in high-throughput scenarios. The software-based methods generally utilize CPU cores to parse and dispatch packets. They work well at low throughput, but they have high CPU overhead at high throughput, leading to packet loss and high latency issues. The hardware-based methods parse the packet and compute a hash over its metadata in hardware and perform load balancing in a coarse-grained manner based on the hash value. They have the ability to work at high throughput with the advantage of low overhead but are less well in balance effect and flexibility. We, therefore, propose an intra-server load balancer based on the reconfigurable hardware, FPGA, to meet the requirements for load balancing within servers in high-speed application scenarios. Our method improves the load-balancing gran- ularity of hardware-based method. It not only has high throughput but also has a good balance effect and flexibility. We implemented and evaluated our method on a 100 G FPGA SmartNIC. The evaluation result shows our method can reduce the load imbalance ratio by an order of magnitude when the distribution of flow size is uneven compared to the current widely used hardware-based method.</p>","PeriodicalId":14154,"journal":{"name":"International Journal of Network Management","volume":"32 6","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2022-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Network Management","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/nem.2211","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 1
Abstract
In today's data centers, workloads including multiple services and requests are processed in parallel within a server with many CPU cores. Therefore, meeting the intra-server load balancing is very important to improve the utilization of CPU resources in the data centers. However, the existing methods cannot well meet the intra-server load balancing in high-throughput scenarios. The software-based methods generally utilize CPU cores to parse and dispatch packets. They work well at low throughput, but they have high CPU overhead at high throughput, leading to packet loss and high latency issues. The hardware-based methods parse the packet and compute a hash over its metadata in hardware and perform load balancing in a coarse-grained manner based on the hash value. They have the ability to work at high throughput with the advantage of low overhead but are less well in balance effect and flexibility. We, therefore, propose an intra-server load balancer based on the reconfigurable hardware, FPGA, to meet the requirements for load balancing within servers in high-speed application scenarios. Our method improves the load-balancing gran- ularity of hardware-based method. It not only has high throughput but also has a good balance effect and flexibility. We implemented and evaluated our method on a 100 G FPGA SmartNIC. The evaluation result shows our method can reduce the load imbalance ratio by an order of magnitude when the distribution of flow size is uneven compared to the current widely used hardware-based method.
期刊介绍:
Modern computer networks and communication systems are increasing in size, scope, and heterogeneity. The promise of a single end-to-end technology has not been realized and likely never will occur. The decreasing cost of bandwidth is increasing the possible applications of computer networks and communication systems to entirely new domains. Problems in integrating heterogeneous wired and wireless technologies, ensuring security and quality of service, and reliably operating large-scale systems including the inclusion of cloud computing have all emerged as important topics. The one constant is the need for network management. Challenges in network management have never been greater than they are today. The International Journal of Network Management is the forum for researchers, developers, and practitioners in network management to present their work to an international audience. The journal is dedicated to the dissemination of information, which will enable improved management, operation, and maintenance of computer networks and communication systems. The journal is peer reviewed and publishes original papers (both theoretical and experimental) by leading researchers, practitioners, and consultants from universities, research laboratories, and companies around the world. Issues with thematic or guest-edited special topics typically occur several times per year. Topic areas for the journal are largely defined by the taxonomy for network and service management developed by IFIP WG6.6, together with IEEE-CNOM, the IRTF-NMRG and the Emanics Network of Excellence.