首页 > 最新文献

Proceedings of the 2021 ACM SIGCOMM 2021 Conference最新文献

英文 中文
revisiting the open vSwitch dataplane ten years later 十年后重新访问开放的vSwitch数据平面
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472914
William Tu, Yingying Wei, G. Antichi, Ben Pfaff
This paper shares our experience in supporting and running the Open vSwitch (OVS) software switch, as part of the NSX product for enterprise data center virtualization used by thousands of VMware customers. Starting in 2009, the OVS design split its code between tightly coupled kernel and userspace components. This split was necessary at the time for performance, but it caused maintainability problems that persist today. In addition, in-kernel packet processing is now much slower than newer options. To solve the problems caused by the user/kernel split, OVS must adopt a new architecture. We describe two possibilities that we explored, but did not adopt, one because it gives up compatibility with drivers and tools that are important to virtual data center operators, the other because it performs poorly. Instead, we endorse a third approach, based on a new Linux socket type called AF_XDP, which solves the maintainability problem in a compatible, performant way. The new code is already merged into the mainstream OVS repository. We include a thorough performance evaluation and a collection of lessons learned.
本文分享了我们在支持和运行Open vSwitch (OVS)软件交换机方面的经验,OVS是用于企业数据中心虚拟化的NSX产品的一部分,被成千上万的VMware客户使用。从2009年开始,OVS设计将其代码拆分为紧密耦合的内核和用户空间组件。这种分割在当时对于性能来说是必要的,但是它导致了今天仍然存在的可维护性问题。此外,内核内包处理现在比新选项慢得多。为了解决由用户/内核分裂引起的问题,OVS必须采用一种新的体系结构。我们描述了两种可能性,一种是因为它放弃了对虚拟数据中心运营商很重要的驱动程序和工具的兼容性,另一种是因为它的性能很差。相反,我们支持第三种方法,它基于一种名为AF_XDP的新的Linux套接字类型,它以一种兼容的、高性能的方式解决了可维护性问题。新代码已经合并到主流OVS存储库中。我们包括全面的绩效评估和经验教训的收集。
{"title":"revisiting the open vSwitch dataplane ten years later","authors":"William Tu, Yingying Wei, G. Antichi, Ben Pfaff","doi":"10.1145/3452296.3472914","DOIUrl":"https://doi.org/10.1145/3452296.3472914","url":null,"abstract":"This paper shares our experience in supporting and running the Open vSwitch (OVS) software switch, as part of the NSX product for enterprise data center virtualization used by thousands of VMware customers. Starting in 2009, the OVS design split its code between tightly coupled kernel and userspace components. This split was necessary at the time for performance, but it caused maintainability problems that persist today. In addition, in-kernel packet processing is now much slower than newer options. To solve the problems caused by the user/kernel split, OVS must adopt a new architecture. We describe two possibilities that we explored, but did not adopt, one because it gives up compatibility with drivers and tools that are important to virtual data center operators, the other because it performs poorly. Instead, we endorse a third approach, based on a new Linux socket type called AF_XDP, which solves the maintainability problem in a compatible, performant way. The new code is already merged into the mainstream OVS repository. We include a thorough performance evaluation and a collection of lessons learned.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89686764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
RoS: passive smart surface for roadside-to-vehicle communication RoS:用于道路与车辆通信的被动智能表面
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472896
John Nolan, Kun Qian, Xinyu Zhang
Modern autonomous vehicles are commonly instrumented with radars for all-weather perception. Yet the radar functionality is limited to identifying the positions of reflectors in the environment. In this paper, we investigate the feasibility of smartening transportation infrastructure for the purpose of conveying richer information to automotive radars. We propose RoS, a passive PCB-fabricated smart surface which can be reconfigured to embed digital bits, and inform the radar much like visual road signs do to cameras. We design the RoS signage to act as a retrodirective reflector which can reflect signals back to the radar from wide viewing angles. We further introduce a spatial encoding scheme, which piggybacks information in the reflected analog signals based on the geometrical layout of the retroreflective elements. Our prototype fabrication and experimentation verifies the effectiveness of RoS as an RF ''barcode'' which is readable by radar in practical transportation environment.
现代自动驾驶汽车通常配备雷达,以实现全天候感知。然而,雷达的功能仅限于识别环境中反射器的位置。在本文中,我们研究了智能交通基础设施的可行性,目的是向汽车雷达传递更丰富的信息。我们提出了RoS,这是一种被动pcb制造的智能表面,可以重新配置以嵌入数字位,并像视觉道路标志对摄像机一样通知雷达。我们设计的RoS标识作为一个反向反射器,可以从宽视角将信号反射回雷达。我们进一步介绍了一种空间编码方案,该方案基于反射元件的几何布局来承载反射模拟信号中的信息。我们的原型制作和实验验证了RoS作为射频“条形码”的有效性,该条形码可在实际运输环境中被雷达读取。
{"title":"RoS: passive smart surface for roadside-to-vehicle communication","authors":"John Nolan, Kun Qian, Xinyu Zhang","doi":"10.1145/3452296.3472896","DOIUrl":"https://doi.org/10.1145/3452296.3472896","url":null,"abstract":"Modern autonomous vehicles are commonly instrumented with radars for all-weather perception. Yet the radar functionality is limited to identifying the positions of reflectors in the environment. In this paper, we investigate the feasibility of smartening transportation infrastructure for the purpose of conveying richer information to automotive radars. We propose RoS, a passive PCB-fabricated smart surface which can be reconfigured to embed digital bits, and inform the radar much like visual road signs do to cameras. We design the RoS signage to act as a retrodirective reflector which can reflect signals back to the radar from wide viewing angles. We further introduce a spatial encoding scheme, which piggybacks information in the reflected analog signals based on the geometrical layout of the retroreflective elements. Our prototype fabrication and experimentation verifies the effectiveness of RoS as an RF ''barcode'' which is readable by radar in practical transportation environment.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88741555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
BlueFi
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472920
Hsun-Wei Cho, K. Shin
Bluetooth and WiFi are the two dominant technologies enabling the communication of mobile and IoT devices. Built with specific design goals and principles, they are vastly different, each using its own hardware and software. Thus, they are not interoperable and require different hardware. One may, therefore, ask a simple, yet seemingly impossible question: “Can we transmit Bluetooth packets on commercial off-the-shelf (COTS) WiFi hardware?” We answer this question positively by designing, implementing and demonstrating a novel system called BlueFi. It can readily run on existing, widely-deployed WiFi devices without modifying NIC firmware/hardware. BlueFi works by reversing the signal processing of WiFi hardware and finds special 802.11n packets that are decodable by unmodified Bluetooth devices. With BlueFi, every 802.11n device can be used simultaneously as a Bluetooth device, which instantly increases the coverage of Bluetooth, thanks to the omnipresence of WiFi devices. BlueFi is particularly useful for WiFi-only devices or environments. We implement and evaluate BlueFi on devices with widely-adopted WiFi chips. We also construct two prevalent end-to-end apps — Bluetooth beacon and audio — to showcase the practical use of BlueFi. The former allows ordinary APs to send location beacons; the latter enables WiFi chips to stream Bluetooth audio in real time.
{"title":"BlueFi","authors":"Hsun-Wei Cho, K. Shin","doi":"10.1145/3452296.3472920","DOIUrl":"https://doi.org/10.1145/3452296.3472920","url":null,"abstract":"Bluetooth and WiFi are the two dominant technologies enabling the communication of mobile and IoT devices. Built with specific design goals and principles, they are vastly different, each using its own hardware and software. Thus, they are not interoperable and require different hardware. One may, therefore, ask a simple, yet seemingly impossible question: “Can we transmit Bluetooth packets on commercial off-the-shelf (COTS) WiFi hardware?” We answer this question positively by designing, implementing and demonstrating a novel system called BlueFi. It can readily run on existing, widely-deployed WiFi devices without modifying NIC firmware/hardware. BlueFi works by reversing the signal processing of WiFi hardware and finds special 802.11n packets that are decodable by unmodified Bluetooth devices. With BlueFi, every 802.11n device can be used simultaneously as a Bluetooth device, which instantly increases the coverage of Bluetooth, thanks to the omnipresence of WiFi devices. BlueFi is particularly useful for WiFi-only devices or environments. We implement and evaluate BlueFi on devices with widely-adopted WiFi chips. We also construct two prevalent end-to-end apps — Bluetooth beacon and audio — to showcase the practical use of BlueFi. The former allows ordinary APs to send location beacons; the latter enables WiFi chips to stream Bluetooth audio in real time.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74332904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Anycast In context: a tale of two systems 上下文中:两个系统的故事
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472891
T. Koch, Ethan Katz-Bassett, J. Heidemann, Matt Calder, Calvin Ardi, Ke Li
Anycast is used to serve content including web pages and DNS, and anycast deployments are growing. However, prior work examining root DNS suggests anycast deployments incur significant inflation, with users often routed to suboptimal sites. We reassess anycast performance, first extending prior analysis on inflation in the root DNS. We show that inflation is very common in root DNS, affecting more than 95% of users. However, we then show root DNS latency emph{hardly matters} to users because caching is so effective. These findings lead us to question: is inflation inherent to anycast, or can inflation be limited when it matters? To answer this question, we consider Microsoft's anycast CDN serving latency-sensitive content. Here, latency matters orders of magnitude more than for root DNS. Perhaps because of this need, only 35% of CDN users experience any inflation, and the amount they experience is smaller than root DNS. We show that CDN anycast latency has little inflation due to extensive peering and engineering. These results suggest prior claims of anycast inefficiency reflect experiments on a single application rather than anycast's technical potential, and they demonstrate the importance of context when measuring system performance.
任意播用于提供包括网页和DNS在内的内容,并且任意播的部署正在增长。然而,先前对根DNS的研究表明,任意播部署会导致严重的膨胀,用户经常被路由到次优站点。我们重新评估任播性能,首先扩展先前对根DNS膨胀的分析。我们发现膨胀在根DNS中非常常见,影响超过95%的用户。然而,我们随后展示了根DNS延迟对用户emph{来说几乎无关紧要},因为缓存非常有效。这些发现让我们产生疑问:通货膨胀是任何预测都固有的,还是在重要的时候通货膨胀是有限的?为了回答这个问题,我们考虑微软的任播CDN服务延迟敏感内容。在这里,延迟比根DNS更重要。也许是因为这种需求,只有35%的CDN用户经历过通货膨胀,而且他们经历的通货膨胀比根DNS要小。我们表明,由于广泛的对等和工程,CDN任播延迟几乎没有膨胀。这些结果表明,先前声称anycast效率低下反映的是对单个应用程序的实验,而不是anycast的技术潜力,并且它们表明了在测量系统性能时上下文的重要性。
{"title":"Anycast In context: a tale of two systems","authors":"T. Koch, Ethan Katz-Bassett, J. Heidemann, Matt Calder, Calvin Ardi, Ke Li","doi":"10.1145/3452296.3472891","DOIUrl":"https://doi.org/10.1145/3452296.3472891","url":null,"abstract":"Anycast is used to serve content including web pages and DNS, and anycast deployments are growing. However, prior work examining root DNS suggests anycast deployments incur significant inflation, with users often routed to suboptimal sites. We reassess anycast performance, first extending prior analysis on inflation in the root DNS. We show that inflation is very common in root DNS, affecting more than 95% of users. However, we then show root DNS latency emph{hardly matters} to users because caching is so effective. These findings lead us to question: is inflation inherent to anycast, or can inflation be limited when it matters? To answer this question, we consider Microsoft's anycast CDN serving latency-sensitive content. Here, latency matters orders of magnitude more than for root DNS. Perhaps because of this need, only 35% of CDN users experience any inflation, and the amount they experience is smaller than root DNS. We show that CDN anycast latency has little inflation due to extensive peering and engineering. These results suggest prior claims of anycast inefficiency reflect experiments on a single application rather than anycast's technical potential, and they demonstrate the importance of context when measuring system performance.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91169806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
Programmable packet scheduling with a single queue 单队列可编程数据包调度
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472887
Zhuolong Yu, Chuheng Hu, Jingfeng Wu, Xiao Sun, V. Braverman, Mosharaf Chowdhury, Zhenhua Liu, Xin Jin
Programmable packet scheduling enables scheduling algorithms to be programmed into the data plane without changing the hardware. Existing proposals either have no hardware implementations for switch ASICs or require multiple strict-priority queues. We present Admission-In First-Out (AIFO) queues, a new solution for programmable packet scheduling that uses only a emph{single} first-in first-out queue. AIFO is motivated by the confluence of two recent trends: emph{shallow} buffers in switches and emph{fast-converging} congestion control in end hosts, that together leads to a simple observation: the decisive factor in a flow's completion time (FCT) in modern datacenter networks is often emph{which} packets are enqueued or dropped, not the emph{ordering} they leave the switch. The core idea of AIFO is to maintain a sliding window to track the ranks of recent packets and compute the relative rank of an arriving packet in the window for admission control. Theoretically, we prove that AIFO provides bounded performance to Push-In First-Out (PIFO). Empirically, we fully implement AIFO and evaluate AIFO with a range of real workloads, demonstrating AIFO closely approximates PIFO. Importantly, unlike PIFO, AIFO can run at line rate on existing hardware and use minimal switch resources---as few as a single queue.
可编程包调度使调度算法可以在不改变硬件的情况下被编程到数据平面。现有的建议要么没有交换机asic的硬件实现,要么需要多个严格优先级队列。本文提出了一种新的可编程分组调度方案——AIFO队列,该方案仅使用emph{单个}先入先出队列。AIFO是由最近两种趋势的融合而产生的:交换机的emph{浅}缓冲和终端主机的emph{快速收敛}拥塞控制,这两种趋势共同导致了一个简单的观察:在现代数据中心网络中,流完成时间(FCT)的决定性因素通常是emph{哪些}数据包被排队或丢弃,而不是它们离开交换机的emph{顺序}。AIFO的核心思想是保持一个滑动窗口来跟踪最近数据包的排名,并计算在窗口中到达数据包的相对排名以进行准入控制。理论上,我们证明了AIFO提供了有限的性能推入先出(PIFO)。在经验上,我们完全实现了AIFO,并使用一系列实际工作负载对AIFO进行了评估,证明AIFO非常接近PIFO。重要的是,与PIFO不同,AIFO可以在现有硬件上以行速率运行,并且使用最小的交换机资源——少到一个队列。
{"title":"Programmable packet scheduling with a single queue","authors":"Zhuolong Yu, Chuheng Hu, Jingfeng Wu, Xiao Sun, V. Braverman, Mosharaf Chowdhury, Zhenhua Liu, Xin Jin","doi":"10.1145/3452296.3472887","DOIUrl":"https://doi.org/10.1145/3452296.3472887","url":null,"abstract":"Programmable packet scheduling enables scheduling algorithms to be programmed into the data plane without changing the hardware. Existing proposals either have no hardware implementations for switch ASICs or require multiple strict-priority queues. We present Admission-In First-Out (AIFO) queues, a new solution for programmable packet scheduling that uses only a emph{single} first-in first-out queue. AIFO is motivated by the confluence of two recent trends: emph{shallow} buffers in switches and emph{fast-converging} congestion control in end hosts, that together leads to a simple observation: the decisive factor in a flow's completion time (FCT) in modern datacenter networks is often emph{which} packets are enqueued or dropped, not the emph{ordering} they leave the switch. The core idea of AIFO is to maintain a sliding window to track the ranks of recent packets and compute the relative rank of an arriving packet in the window for admission control. Theoretically, we prove that AIFO provides bounded performance to Push-In First-Out (PIFO). Empirically, we fully implement AIFO and evaluate AIFO with a range of real workloads, demonstrating AIFO closely approximates PIFO. Importantly, unlike PIFO, AIFO can run at line rate on existing hardware and use minimal switch resources---as few as a single queue.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83933265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Prognosis: closed-box analysis of network protocol implementations 预测:网络协议实现的闭箱分析
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472938
Tiago Ferreira, Harrison Brewton, Loris D'antoni, Alexandra Silva
We present Prognosis, a framework offering automated closed-box learning and analysis of models of network protocol implementations. Prognosis can learn models that vary in abstraction level from simple deterministic automata to models containing data operations, such as register updates, and can be used to unlock a variety of analysis techniques -- model checking temporal properties, computing differences between models of two implementations of the same protocol, or improving testing via model-based test generation. Prognosis is modular and easily adaptable to different protocols (e.g. TCP and QUIC) and their implementations. We use Prognosis to learn models of (parts of) three QUIC implementations -- Quiche (Cloudflare), Google QUIC, and Facebook mvfst -- and use these models to analyse the differences between the various implementations. Our analysis provides insights into different design choices and uncovers potential bugs. Concretely, we have found critical bugs in multiple QUIC implementations, which have been acknowledged by the developers.
我们提出了一个提供自动封闭盒学习和网络协议实现模型分析的框架。预后可以学习在抽象级别上变化的模型,从简单的确定性自动机到包含数据操作的模型,例如寄存器更新,并且可以用于解锁各种分析技术——模型检查时间属性,计算相同协议的两个实现的模型之间的差异,或者通过基于模型的测试生成来改进测试。预后是模块化的,很容易适应不同的协议(如TCP和QUIC)及其实现。我们使用预后来学习三种QUIC实现的模型(部分)——Quiche (Cloudflare)、Google QUIC和Facebook mvfst——并使用这些模型来分析各种实现之间的差异。我们的分析提供了对不同设计选择的见解,并揭示了潜在的缺陷。具体地说,我们在多个QUIC实现中发现了严重的错误,这些错误已经被开发人员承认。
{"title":"Prognosis: closed-box analysis of network protocol implementations","authors":"Tiago Ferreira, Harrison Brewton, Loris D'antoni, Alexandra Silva","doi":"10.1145/3452296.3472938","DOIUrl":"https://doi.org/10.1145/3452296.3472938","url":null,"abstract":"We present Prognosis, a framework offering automated closed-box learning and analysis of models of network protocol implementations. Prognosis can learn models that vary in abstraction level from simple deterministic automata to models containing data operations, such as register updates, and can be used to unlock a variety of analysis techniques -- model checking temporal properties, computing differences between models of two implementations of the same protocol, or improving testing via model-based test generation. Prognosis is modular and easily adaptable to different protocols (e.g. TCP and QUIC) and their implementations. We use Prognosis to learn models of (parts of) three QUIC implementations -- Quiche (Cloudflare), Google QUIC, and Facebook mvfst -- and use these models to analyse the differences between the various implementations. Our analysis provides insights into different design choices and uncovers potential bugs. Concretely, we have found critical bugs in multiple QUIC implementations, which have been acknowledged by the developers.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76580883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
A throughput-centric view of the performance of datacenter topologies 以吞吐量为中心的数据中心拓扑性能视图
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472913
Pooria Namyar, Sucha Supittayapornpong, Mingyang Zhang, Minlan Yu, R. Govindan
While prior work has explored many proposed datacenter designs, only two designs, Clos-based and expander-based, are generally considered practical because they can scale using commodity switching chips. Prior work has used two different metrics, bisection bandwidth and throughput, for evaluating these topologies at scale. Little is known, theoretically or practically, how these metrics relate to each other. Exploiting characteristics of these topologies, we prove an upper bound on their throughput, then show that this upper bound better estimates worst-case throughput than all previously proposed throughput estimators and scales better than most of them. Using this upper bound, we show that for expander-based topologies, unlike Clos, beyond a certain size of the network, no topology can have full throughput, even if it has full bisection bandwidth; in fact, even relatively small expander-based topologies fail to achieve full throughput. We conclude by showing that using throughput to evaluate datacenter performance instead of bisection bandwidth can alter conclusions in prior work about datacenter cost, manageability, and reliability.
虽然以前的工作已经探索了许多提出的数据中心设计,但只有两种设计,基于clos和基于扩展器的设计,通常被认为是实用的,因为它们可以使用商用交换芯片进行扩展。先前的工作使用了两种不同的度量,二分带宽和吞吐量,用于大规模评估这些拓扑结构。无论从理论上还是在实践中,我们都不知道这些指标是如何相互关联的。利用这些拓扑的特点,我们证明了它们的吞吐量的上界,然后表明这个上界比所有以前提出的吞吐量估计器更好地估计最坏情况吞吐量,并且比大多数它们更好地扩展。使用这个上界,我们表明,对于基于扩展器的拓扑,与Clos不同,超过一定的网络大小,没有拓扑可以具有完全的吞吐量,即使它具有完全的二分带宽;事实上,即使是相对较小的基于扩展器的拓扑也无法实现完全吞吐量。我们的结论是,使用吞吐量来评估数据中心性能而不是平分带宽可以改变先前关于数据中心成本、可管理性和可靠性的结论。
{"title":"A throughput-centric view of the performance of datacenter topologies","authors":"Pooria Namyar, Sucha Supittayapornpong, Mingyang Zhang, Minlan Yu, R. Govindan","doi":"10.1145/3452296.3472913","DOIUrl":"https://doi.org/10.1145/3452296.3472913","url":null,"abstract":"While prior work has explored many proposed datacenter designs, only two designs, Clos-based and expander-based, are generally considered practical because they can scale using commodity switching chips. Prior work has used two different metrics, bisection bandwidth and throughput, for evaluating these topologies at scale. Little is known, theoretically or practically, how these metrics relate to each other. Exploiting characteristics of these topologies, we prove an upper bound on their throughput, then show that this upper bound better estimates worst-case throughput than all previously proposed throughput estimators and scales better than most of them. Using this upper bound, we show that for expander-based topologies, unlike Clos, beyond a certain size of the network, no topology can have full throughput, even if it has full bisection bandwidth; in fact, even relatively small expander-based topologies fail to achieve full throughput. We conclude by showing that using throughput to evaluate datacenter performance instead of bisection bandwidth can alter conclusions in prior work about datacenter cost, manageability, and reliability.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80954468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
L2D2
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472932
Deepak Vasisht, Jayanth Shenoy, Ranveer Chandra
Large constellations of Low Earth Orbit satellites promise to provide near real-time high-resolution Earth imagery. Yet, getting this large amount of data back to Earth is challenging because of their low orbits and fast motion through space. Centralized architectures with few multi-million dollar ground stations incur large hour-level data download latency and are hard to scale. We propose a geographically distributed ground station design, L2D2, that uses low-cost commodity hardware to offer low latency robust downlink. L2D2 is the first system to use a hybrid ground station model, where only a subset of ground stations are uplink-capable. We design new algorithms for scheduling and rate adaptation that enable low latency and high robustness despite the limitations of the receive-only ground stations. We evaluate L2D2 through a combination of trace-driven simulations and real-world satellite-ground station measurements. Our results demonstrate that L2D2's geographically distributed design can reduce data downlink latency from 90 minutes to 21 minutes.
{"title":"L2D2","authors":"Deepak Vasisht, Jayanth Shenoy, Ranveer Chandra","doi":"10.1145/3452296.3472932","DOIUrl":"https://doi.org/10.1145/3452296.3472932","url":null,"abstract":"Large constellations of Low Earth Orbit satellites promise to provide near real-time high-resolution Earth imagery. Yet, getting this large amount of data back to Earth is challenging because of their low orbits and fast motion through space. Centralized architectures with few multi-million dollar ground stations incur large hour-level data download latency and are hard to scale. We propose a geographically distributed ground station design, L2D2, that uses low-cost commodity hardware to offer low latency robust downlink. L2D2 is the first system to use a hybrid ground station model, where only a subset of ground stations are uplink-capable. We design new algorithms for scheduling and rate adaptation that enable low latency and high robustness despite the limitations of the receive-only ground stations. We evaluate L2D2 through a combination of trace-driven simulations and real-world satellite-ground station measurements. Our results demonstrate that L2D2's geographically distributed design can reduce data downlink latency from 90 minutes to 21 minutes.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78870414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
RedPlane: enabling fault-tolerant stateful in-switch applications RedPlane:启用容错的交换机内状态应用
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472905
Daehyeok Kim, J. Nelson, Dan R. K. Ports, V. Sekar, S. Seshan
Many recent efforts have demonstrated the performance benefits of running datacenter functions (emph{e.g.,} NATs, load balancers, monitoring) on programmable switches. However, a key missing piece remains: fault tolerance. This is especially critical as the network is no longer stateless and pure endpoint recovery does not suffice. In this paper, we design and implement RedPlane, a fault-tolerant state store for stateful in-switch applications. This provides in-switch applications consistent access to their state, even if the switch they run on fails or traffic is rerouted to an alternative switch. We address key challenges in devising a practical, provably correct replication protocol and implementing it in the switch data plane. Our evaluations show that RedPlane incurs negligible overhead and enables end-to-end applications to rapidly recover from switch failures.
最近的许多工作已经证明了在可编程交换机上运行数据中心功能(emph{例如},nat、负载平衡器、监控)的性能优势。但是,仍然缺少一个关键部分:容错。当网络不再是无状态的,纯端点恢复是不够的,这一点尤其重要。在本文中,我们设计并实现了RedPlane,一个容错状态存储,用于有状态交换应用程序。这为交换机内应用程序提供了对其状态的一致访问,即使它们运行的交换机发生故障或流量被重路由到备用交换机。我们解决了设计一个实用的、可证明正确的复制协议并在交换机数据平面上实现它的关键挑战。我们的评估表明,RedPlane产生的开销可以忽略不计,并使端到端应用程序能够从交换机故障中快速恢复。
{"title":"RedPlane: enabling fault-tolerant stateful in-switch applications","authors":"Daehyeok Kim, J. Nelson, Dan R. K. Ports, V. Sekar, S. Seshan","doi":"10.1145/3452296.3472905","DOIUrl":"https://doi.org/10.1145/3452296.3472905","url":null,"abstract":"Many recent efforts have demonstrated the performance benefits of running datacenter functions (emph{e.g.,} NATs, load balancers, monitoring) on programmable switches. However, a key missing piece remains: fault tolerance. This is especially critical as the network is no longer stateless and pure endpoint recovery does not suffice. In this paper, we design and implement RedPlane, a fault-tolerant state store for stateful in-switch applications. This provides in-switch applications consistent access to their state, even if the switch they run on fails or traffic is rerouted to an alternative switch. We address key challenges in devising a practical, provably correct replication protocol and implementing it in the switch data plane. Our evaluations show that RedPlane incurs negligible overhead and enables end-to-end applications to rapidly recover from switch failures.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77790052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Insights from operating an IP exchange provider 运营IP交换提供商的见解
Pub Date : 2021-08-09 DOI: 10.1145/3452296.3472930
Andra Lutu, Diego Perino, M. Bagnulo, F. Bustamante
IP Exchange Providers (IPX-Ps) offer to their customers (e.g., mobile or IoT service providers) global data roaming and support for a variety of emerging services. They peer to other IPX-Ps and form the IPX network, which interconnects 800 MNOs worldwide offering their customers access to mobile services in any other country. Despite the importance of IPX-Ps, little is known about their operations and performance. In this paper, we shed light on these opaque providers by analyzing a large IPX-P with more than 100 PoPs in 40+ countries, with a particularly strong presence in America and Europe. Specifically, we characterize the traffic and performance of the main infrastructures of the IPX-P (i.e., 2-3-4G signaling and GTP tunneling), and provide implications for its operation, as well as for the IPX-P's customers. Our analysis is based on statistics we collected during two time periods (i.e., prior and during COVID-19 pandemic) and includes insights on the main service the platform supports (i.e., IoT and data roaming), traffic breakdown and geographical/temporal distribution, communication performance (e.g., tunnel setup time, RTTs). Our results constitute a step towards advancing the understanding of IPX-Ps at their core, and provide guidelines for their operations and customer satisfaction.
IP交换提供商(ipx - p)为其客户(例如移动或物联网服务提供商)提供全球数据漫游和对各种新兴服务的支持。它们与其他IPX- p对等并形成IPX网络,该网络将全球800个mno互连起来,为其客户提供在任何其他国家的移动服务。尽管ipx - p很重要,但人们对它们的操作和性能知之甚少。在本文中,我们通过分析一个大型IPX-P来揭示这些不透明的供应商,该IPX-P在40多个国家拥有100多个pop,在美国和欧洲尤其强大。具体来说,我们描述了IPX-P主要基础设施(即2-3-4G信令和GTP隧道)的流量和性能,并提供了其操作以及IPX-P客户的影响。我们的分析基于我们在两个时间段(即COVID-19大流行之前和期间)收集的统计数据,并包括对平台支持的主要服务(即物联网和数据漫游)、流量分解和地理/时间分布、通信性能(例如隧道建立时间、rtt)的见解。我们的研究结果是推动对IPX-Ps核心理解的一步,并为其运营和客户满意度提供指导。
{"title":"Insights from operating an IP exchange provider","authors":"Andra Lutu, Diego Perino, M. Bagnulo, F. Bustamante","doi":"10.1145/3452296.3472930","DOIUrl":"https://doi.org/10.1145/3452296.3472930","url":null,"abstract":"IP Exchange Providers (IPX-Ps) offer to their customers (e.g., mobile or IoT service providers) global data roaming and support for a variety of emerging services. They peer to other IPX-Ps and form the IPX network, which interconnects 800 MNOs worldwide offering their customers access to mobile services in any other country. Despite the importance of IPX-Ps, little is known about their operations and performance. In this paper, we shed light on these opaque providers by analyzing a large IPX-P with more than 100 PoPs in 40+ countries, with a particularly strong presence in America and Europe. Specifically, we characterize the traffic and performance of the main infrastructures of the IPX-P (i.e., 2-3-4G signaling and GTP tunneling), and provide implications for its operation, as well as for the IPX-P's customers. Our analysis is based on statistics we collected during two time periods (i.e., prior and during COVID-19 pandemic) and includes insights on the main service the platform supports (i.e., IoT and data roaming), traffic breakdown and geographical/temporal distribution, communication performance (e.g., tunnel setup time, RTTs). Our results constitute a step towards advancing the understanding of IPX-Ps at their core, and provide guidelines for their operations and customer satisfaction.","PeriodicalId":20487,"journal":{"name":"Proceedings of the 2021 ACM SIGCOMM 2021 Conference","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73961434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2021 ACM SIGCOMM 2021 Conference
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1