{"title":"面向电信级通用节点","authors":"T. Korikawa, Akio Kawabata, A. Masuda","doi":"10.1109/ICCNC.2017.7876185","DOIUrl":null,"url":null,"abstract":"In this paper, we elucidate the main performance bottleneck in realizing packet processing which require carrier-scale huge tables to lookup (carrier-scale packet processing) on top of general-purpose servers. Our experimental quantitative analysis using DPDK-Click based packet processing model reveals that performance degradation of the carrier-scale packet processing is mainly caused by increased CPU cache misses with increasing number of input traffic flows. This indicates that the main performance bottleneck of carrier-scale packet processing using general-purpose hardware is the lack of memory response performance. As a solution to the bottleneck, we propose novel server architecture that utilizes Hybrid Memory Cube (HMC), a sort of 3D-stacked DRAM, to reinforce memory response performance in terms of concurrency. Our simulation based evaluation shows that our proposed server architecture can realize more than 40 Gbps carrier-scale packet processing without any dedicated hardware. Our work shows that augmented concurrency of memory response is as important as parallel processing at multi-core CPU to improve the system level performance of carrier-scale packet processing using general-purpose hardware.","PeriodicalId":135028,"journal":{"name":"2017 International Conference on Computing, Networking and Communications (ICNC)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward carrier-scale general-purpose node\",\"authors\":\"T. Korikawa, Akio Kawabata, A. Masuda\",\"doi\":\"10.1109/ICCNC.2017.7876185\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we elucidate the main performance bottleneck in realizing packet processing which require carrier-scale huge tables to lookup (carrier-scale packet processing) on top of general-purpose servers. Our experimental quantitative analysis using DPDK-Click based packet processing model reveals that performance degradation of the carrier-scale packet processing is mainly caused by increased CPU cache misses with increasing number of input traffic flows. This indicates that the main performance bottleneck of carrier-scale packet processing using general-purpose hardware is the lack of memory response performance. As a solution to the bottleneck, we propose novel server architecture that utilizes Hybrid Memory Cube (HMC), a sort of 3D-stacked DRAM, to reinforce memory response performance in terms of concurrency. Our simulation based evaluation shows that our proposed server architecture can realize more than 40 Gbps carrier-scale packet processing without any dedicated hardware. Our work shows that augmented concurrency of memory response is as important as parallel processing at multi-core CPU to improve the system level performance of carrier-scale packet processing using general-purpose hardware.\",\"PeriodicalId\":135028,\"journal\":{\"name\":\"2017 International Conference on Computing, Networking and Communications (ICNC)\",\"volume\":\"35 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 International Conference on Computing, Networking and Communications (ICNC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCNC.2017.7876185\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 International Conference on Computing, Networking and Communications (ICNC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCNC.2017.7876185","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
In this paper, we elucidate the main performance bottleneck in realizing packet processing which require carrier-scale huge tables to lookup (carrier-scale packet processing) on top of general-purpose servers. Our experimental quantitative analysis using DPDK-Click based packet processing model reveals that performance degradation of the carrier-scale packet processing is mainly caused by increased CPU cache misses with increasing number of input traffic flows. This indicates that the main performance bottleneck of carrier-scale packet processing using general-purpose hardware is the lack of memory response performance. As a solution to the bottleneck, we propose novel server architecture that utilizes Hybrid Memory Cube (HMC), a sort of 3D-stacked DRAM, to reinforce memory response performance in terms of concurrency. Our simulation based evaluation shows that our proposed server architecture can realize more than 40 Gbps carrier-scale packet processing without any dedicated hardware. Our work shows that augmented concurrency of memory response is as important as parallel processing at multi-core CPU to improve the system level performance of carrier-scale packet processing using general-purpose hardware.