首页 > 最新文献

Proceedings of the 7th Asia-Pacific Workshop on Networking最新文献

英文 中文
Scaling Switch-driven Flow Control with Aquarius 缩放开关驱动的流量控制水瓶座
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3600066
Wenxue Li, Chaoliang Zeng, Jinbin Hu, Kai Chen
As datacenter networks support more diverse applications and faster link speeds, effective end-to-end congestion control becomes increasingly challenging due to the inherent feedback delay. To address this issue, switch-driven per-hop flow control (FC) has gained popularity due to its natural flow isolation, timely control loop, and ability to handle transient congestion. However, the ideal FC requires impractical hardware resources, and the state-of-the-art approximation approach still demands a large number of queues that exceeds common switch capabilities, limiting scalability in practice. In this paper, we propose Aquarius, a scalable solution for per-hop FC that maintains satisfactory flow isolation with a practical number of queues. The key idea of Aquarius is to take independent control of different flows within the same queue, discarding the traditional practice of managing traffic collectively within the same queue. At its core, Aquarius applies a contribution-aware pausing mechanism on congested switches to enable individual control decisions for arriving flows, and uses an opportunistic re-assigning strategy on upstream switches to further isolate congested and victim flows. Experimental results demonstrate that Aquarius maintains comparable performance with 4 × fewer queues, and achieves 5.5 × lower flow completion times using the same number of queues, compared to existing solutions.
随着数据中心网络支持更多样化的应用程序和更快的链路速度,由于固有的反馈延迟,有效的端到端拥塞控制变得越来越具有挑战性。为了解决这个问题,交换机驱动的每跳流控制(FC)由于其自然的流隔离、及时的控制回路和处理瞬态拥塞的能力而得到了普及。然而,理想的FC需要不切实际的硬件资源,而最先进的近似方法仍然需要大量的队列,这超出了普通交换机的能力,限制了实践中的可伸缩性。在本文中,我们提出了Aquarius,这是一种可扩展的每跳FC解决方案,它可以在实际数量的队列中保持令人满意的流隔离。Aquarius的关键思想是对同一队列内的不同流量进行独立控制,抛弃了在同一队列内集体管理流量的传统做法。其核心是,Aquarius在拥塞交换机上应用了贡献感知暂停机制,以实现对到达流的单独控制决策,并在上游交换机上使用机会重新分配策略,以进一步隔离拥塞流和受害流。实验结果表明,与现有解决方案相比,Aquarius在减少4倍队列的情况下保持了相当的性能,并在使用相同数量队列的情况下实现了5.5倍的流程完成时间。
{"title":"Scaling Switch-driven Flow Control with Aquarius","authors":"Wenxue Li, Chaoliang Zeng, Jinbin Hu, Kai Chen","doi":"10.1145/3600061.3600066","DOIUrl":"https://doi.org/10.1145/3600061.3600066","url":null,"abstract":"As datacenter networks support more diverse applications and faster link speeds, effective end-to-end congestion control becomes increasingly challenging due to the inherent feedback delay. To address this issue, switch-driven per-hop flow control (FC) has gained popularity due to its natural flow isolation, timely control loop, and ability to handle transient congestion. However, the ideal FC requires impractical hardware resources, and the state-of-the-art approximation approach still demands a large number of queues that exceeds common switch capabilities, limiting scalability in practice. In this paper, we propose Aquarius, a scalable solution for per-hop FC that maintains satisfactory flow isolation with a practical number of queues. The key idea of Aquarius is to take independent control of different flows within the same queue, discarding the traditional practice of managing traffic collectively within the same queue. At its core, Aquarius applies a contribution-aware pausing mechanism on congested switches to enable individual control decisions for arriving flows, and uses an opportunistic re-assigning strategy on upstream switches to further isolate congested and victim flows. Experimental results demonstrate that Aquarius maintains comparable performance with 4 × fewer queues, and achieves 5.5 × lower flow completion times using the same number of queues, compared to existing solutions.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124825840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Improved Data Aggregation Scheme for Wireless Sensor Networks Based on Robust Chinese Remainder Theorem 一种改进的基于鲁棒中国剩余定理的无线传感器网络数据聚合方案
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3603176
Jinxin Zhang, Fuyou Miao
In wireless sensor networks (WSNs), energy efficiency, reliability, and non-plaintext transmission of the sensed data are major concerns, and all three of them are indispensable. Based on robust Chinese Remainder Theorem (RCRT), this paper proposes an improved data aggregation scheme to satisfy the requirements of energy efficiency, reliability, and non-plaintext transmission simultaneously. Compared with the existing RCRT-based data aggregation scheme, our improved RCRT-based data aggregation scheme tolerates an unrestricted error at the expense of certain energy saving.
在无线传感器网络(WSNs)中,能源效率、可靠性和传感数据的非明文传输是主要关注的问题,三者缺一不可少。基于鲁棒中国剩余定理(RCRT),提出了一种改进的数据聚合方案,以同时满足能源效率、可靠性和非明文传输的要求。与现有的基于rcrt的数据聚合方案相比,改进的基于rcrt的数据聚合方案以一定的节能为代价,允许不受限制的误差。
{"title":"An Improved Data Aggregation Scheme for Wireless Sensor Networks Based on Robust Chinese Remainder Theorem","authors":"Jinxin Zhang, Fuyou Miao","doi":"10.1145/3600061.3603176","DOIUrl":"https://doi.org/10.1145/3600061.3603176","url":null,"abstract":"In wireless sensor networks (WSNs), energy efficiency, reliability, and non-plaintext transmission of the sensed data are major concerns, and all three of them are indispensable. Based on robust Chinese Remainder Theorem (RCRT), this paper proposes an improved data aggregation scheme to satisfy the requirements of energy efficiency, reliability, and non-plaintext transmission simultaneously. Compared with the existing RCRT-based data aggregation scheme, our improved RCRT-based data aggregation scheme tolerates an unrestricted error at the expense of certain energy saving.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132700195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improving Cloud Storage Network Bandwidth Utilization of Scientific Applications 提高云存储网络带宽利用率的科学应用
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3603122
Steven W. D. Chien, Kento Sato, Artur Podobas, Niclas Jansson, S. Markidis, Michio Honda
Cloud providers began to provide managed services to attract scientific applications, which have been traditionally executed on supercomputers. One example is AWS FSx for Lustre, a fully managed parallel file system (PFS) released in 2018. However, due to the nature of scientific applications, the frontend storage network bandwidth is left completely idle for the majority of its lifetime. Furthermore, the pricing model does not match the scalability requirement. We propose iFast, a novel host-side caching mechanism for scientific applications that improves storage bandwidth utilization and end-to-end application performance: by overlapping compute and data writeback through inexpensive local storage. iFast supports the Massage Passing Interface (MPI) library that is widely used by scientific applications and is implemented as a preloaded library. It requires no change to applications, the MPI library, or support from cloud operators. We demonstrate how iFast can accelerate the end-to-end time of a representative scientific application Neko, by 13–40%.
云提供商开始提供托管服务,以吸引传统上在超级计算机上执行的科学应用程序。其中一个例子是2018年发布的完全托管并行文件系统(PFS) AWS FSx for Lustre。然而,由于科学应用的性质,前端存储网络带宽在其生命周期的大部分时间内是完全空闲的。此外,定价模型与可伸缩性需求不匹配。我们提出了iFast,一种用于科学应用的新型主机端缓存机制,它通过廉价的本地存储重叠计算和数据回写,提高了存储带宽利用率和端到端应用性能。iFast支持在科学应用中广泛使用的按摩传递接口(MPI)库,并作为预加载库实现。它不需要更改应用程序、MPI库或云运营商的支持。我们演示了iFast如何将具有代表性的科学应用程序Neko的端到端时间缩短13-40%。
{"title":"Improving Cloud Storage Network Bandwidth Utilization of Scientific Applications","authors":"Steven W. D. Chien, Kento Sato, Artur Podobas, Niclas Jansson, S. Markidis, Michio Honda","doi":"10.1145/3600061.3603122","DOIUrl":"https://doi.org/10.1145/3600061.3603122","url":null,"abstract":"Cloud providers began to provide managed services to attract scientific applications, which have been traditionally executed on supercomputers. One example is AWS FSx for Lustre, a fully managed parallel file system (PFS) released in 2018. However, due to the nature of scientific applications, the frontend storage network bandwidth is left completely idle for the majority of its lifetime. Furthermore, the pricing model does not match the scalability requirement. We propose iFast, a novel host-side caching mechanism for scientific applications that improves storage bandwidth utilization and end-to-end application performance: by overlapping compute and data writeback through inexpensive local storage. iFast supports the Massage Passing Interface (MPI) library that is widely used by scientific applications and is implemented as a preloaded library. It requires no change to applications, the MPI library, or support from cloud operators. We demonstrate how iFast can accelerate the end-to-end time of a representative scientific application Neko, by 13–40%.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117141715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SegaNet: An Advanced IoT Cloud Gateway for Performant and Priority-Oriented Message Delivery SegaNet:一种高级物联网云网关,用于高性能和面向优先级的消息传递
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3600072
Yeonho Yoo, Zhixiong Niu, C. Yoo, Peng Cheng, Y. Xiong
With the tremendous growth of IoT, the role of IoT cloud gateways in facilitating communication between IoT devices and the cloud has become more important than ever before. Most previous studies have focused on developing interoperability between IoT and cloud to accommodate various radio protocols. However, they have often neglected the performance aspect of the IoT cloud gateway, leaving users with limited options: either purchasing multiple gateways or connecting only a small number of IoT devices. Through our comprehensive measurements and analysis, we identified five key issues in IoT cloud gateways related to high latency, CPU bottlenecks, inefficient network stacks on ARM, substantial encryption overhead, and the lack of priority support. To address these issues, we propose a new IoT cloud gateway - SegaNet. We carefully design with 1) multiple agents management, 2) efficient TLS encryption, and 3) priority-oriented message delivery. Our prototype evaluation shows up to 16.7 × lower latency and 4.5 × lower CPU consumption than gateways of the existing IoT-cloud ecosystem.
随着物联网的迅猛发展,物联网云网关在促进物联网设备与云之间通信方面的作用变得比以往任何时候都更加重要。以前的大多数研究都集中在开发物联网和云之间的互操作性,以适应各种无线电协议。然而,他们往往忽略了物联网云网关的性能方面,给用户留下了有限的选择:要么购买多个网关,要么只连接少量物联网设备。通过全面的测量和分析,我们确定了物联网云网关中的五个关键问题,这些问题与高延迟、CPU瓶颈、ARM上低效的网络堆栈、大量的加密开销以及缺乏优先级支持有关。为了解决这些问题,我们提出了一种新的物联网云网关——SegaNet。我们精心设计了1)多代理管理,2)高效的TLS加密,以及3)面向优先级的消息传递。我们的原型评估显示,与现有物联网云生态系统的网关相比,延迟降低了16.7倍,CPU消耗降低了4.5倍。
{"title":"SegaNet: An Advanced IoT Cloud Gateway for Performant and Priority-Oriented Message Delivery","authors":"Yeonho Yoo, Zhixiong Niu, C. Yoo, Peng Cheng, Y. Xiong","doi":"10.1145/3600061.3600072","DOIUrl":"https://doi.org/10.1145/3600061.3600072","url":null,"abstract":"With the tremendous growth of IoT, the role of IoT cloud gateways in facilitating communication between IoT devices and the cloud has become more important than ever before. Most previous studies have focused on developing interoperability between IoT and cloud to accommodate various radio protocols. However, they have often neglected the performance aspect of the IoT cloud gateway, leaving users with limited options: either purchasing multiple gateways or connecting only a small number of IoT devices. Through our comprehensive measurements and analysis, we identified five key issues in IoT cloud gateways related to high latency, CPU bottlenecks, inefficient network stacks on ARM, substantial encryption overhead, and the lack of priority support. To address these issues, we propose a new IoT cloud gateway - SegaNet. We carefully design with 1) multiple agents management, 2) efficient TLS encryption, and 3) priority-oriented message delivery. Our prototype evaluation shows up to 16.7 × lower latency and 4.5 × lower CPU consumption than gateways of the existing IoT-cloud ecosystem.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124240888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EasyQuantile: Efficient Quantile Tracking in the Data Plane EasyQuantile:有效的分位数跟踪在数据平面
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3600084
Bo Wang, Rongqiang Chen, Lu Tang
Quantile tracking is an essential component of network measurement, where the tracked quantiles of the key performance metrics allow operators to better understand network performance. Given the high network speed and huge volume of traffic, the line-rate packet-processing performance and network visibility of programmable switches make it a trend to track quantiles in the programmable data plane. However, due to the rigorous resource constraints of programmable switches, quantile tracking is required to be both memory and computation efficient to be deployed in the data plane. In this paper, we present EasyQuantile, an efficient quantile tracking approach that has small constant memory usage and involves only hardware-friendly computations. EasyQuantile adopts an adjustable incremental update approach and calculates a pre-specified quantile with high accuracy entirely in the data plane. We implement EasyQuantile on Intel Tofino switches with small resource usage. Trace-driven experiments show that EasyQuantile achieves higher accuracy and lower complexities compared with state-of-the-art approaches.
分位数跟踪是网络测量的重要组成部分,其中关键性能指标的跟踪分位数使运营商能够更好地了解网络性能。由于网络速度快、业务量大,可编程交换机的线速率数据包处理性能和网络可见性使得在可编程数据平面上跟踪分位数成为一种趋势。然而,由于可编程交换机严格的资源限制,分位数跟踪要求在数据平面部署时既要提高内存效率,又要提高计算效率。在本文中,我们提出了一种有效的分位数跟踪方法EasyQuantile,它具有很小的恒定内存使用并且只涉及硬件友好的计算。EasyQuantile采用可调整的增量更新方法,完全在数据平面内以高精度计算预先指定的分位数。我们在Intel Tofino交换机上实现了EasyQuantile,占用了很少的资源。跟踪驱动实验表明,EasyQuantile与现有方法相比具有更高的精度和更低的复杂度。
{"title":"EasyQuantile: Efficient Quantile Tracking in the Data Plane","authors":"Bo Wang, Rongqiang Chen, Lu Tang","doi":"10.1145/3600061.3600084","DOIUrl":"https://doi.org/10.1145/3600061.3600084","url":null,"abstract":"Quantile tracking is an essential component of network measurement, where the tracked quantiles of the key performance metrics allow operators to better understand network performance. Given the high network speed and huge volume of traffic, the line-rate packet-processing performance and network visibility of programmable switches make it a trend to track quantiles in the programmable data plane. However, due to the rigorous resource constraints of programmable switches, quantile tracking is required to be both memory and computation efficient to be deployed in the data plane. In this paper, we present EasyQuantile, an efficient quantile tracking approach that has small constant memory usage and involves only hardware-friendly computations. EasyQuantile adopts an adjustable incremental update approach and calculates a pre-specified quantile with high accuracy entirely in the data plane. We implement EasyQuantile on Intel Tofino switches with small resource usage. Trace-driven experiments show that EasyQuantile achieves higher accuracy and lower complexities compared with state-of-the-art approaches.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126920177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FlexCore: Leveraging XDP-SCTP for Scalable and Resilient Network Slice Service in Future 5G Core FlexCore:利用XDP-SCTP在未来5G核心中提供可扩展和弹性的网络切片服务
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3600073
Bhavishya Sharma, Shwetha Vittal, Antony Franklin
The control plane of 5G Core (5GC) is typically shared among multiple dependent network slices of the data plane. But as the number of dependent slices and services on the common and shared control plane increases, its resilience threat also increases. This paper proposes FlexCore: a 5GC that is not only flexible and scalable but also resilient to cater to various service requirements on both stateful and stateless architectures of 5GC. FlexCore is built with an eXpress Data Path (XDP) and extended Berkeley Packet Filter (eBPF) based SCTP load balancer hooked at the entry point of the 3GPP compliant 5GC control plane, and a set of micro-AMF instances to serve the user requests. Precisely, the FlexCore is fabricated to honor the variety of incoming user requests on the control plane as per the service requirements, like, per slice, per user, or per control procedure of users too. Experiments on a 3GPP compliant 5G testbed show that FlexCore can provide average latency reduction of up to 14% and 79% on stateful and stateless architectures, respectively, and up to 63% latency reduction for latency-critical slices on the slice-aware architecture.
5G Core (5GC)的控制平面通常在数据平面的多个相关网络切片之间共享。但是,随着公共和共享控制平面上依赖的切片和服务数量的增加,其弹性威胁也在增加。本文提出FlexCore:一种不仅具有灵活性和可扩展性,而且具有弹性的5GC,可以满足5GC的有状态和无状态架构的各种服务需求。FlexCore采用eXpress Data Path (XDP)和基于SCTP负载均衡器的扩展Berkeley Packet Filter (eBPF)构建,该负载均衡器连接在符合3GPP的5GC控制平面的入口点,以及一组微amf实例来服务于用户请求。确切地说,FlexCore是为了根据服务需求来满足控制平面上的各种传入用户请求,比如每个切片、每个用户或每个用户的控制过程。在符合3GPP标准的5G测试平台上进行的实验表明,FlexCore可以在有状态和无状态架构上分别提供高达14%和79%的平均延迟减少,并且在切片感知架构上对延迟关键切片的延迟减少高达63%。
{"title":"FlexCore: Leveraging XDP-SCTP for Scalable and Resilient Network Slice Service in Future 5G Core","authors":"Bhavishya Sharma, Shwetha Vittal, Antony Franklin","doi":"10.1145/3600061.3600073","DOIUrl":"https://doi.org/10.1145/3600061.3600073","url":null,"abstract":"The control plane of 5G Core (5GC) is typically shared among multiple dependent network slices of the data plane. But as the number of dependent slices and services on the common and shared control plane increases, its resilience threat also increases. This paper proposes FlexCore: a 5GC that is not only flexible and scalable but also resilient to cater to various service requirements on both stateful and stateless architectures of 5GC. FlexCore is built with an eXpress Data Path (XDP) and extended Berkeley Packet Filter (eBPF) based SCTP load balancer hooked at the entry point of the 3GPP compliant 5GC control plane, and a set of micro-AMF instances to serve the user requests. Precisely, the FlexCore is fabricated to honor the variety of incoming user requests on the control plane as per the service requirements, like, per slice, per user, or per control procedure of users too. Experiments on a 3GPP compliant 5G testbed show that FlexCore can provide average latency reduction of up to 14% and 79% on stateful and stateless architectures, respectively, and up to 63% latency reduction for latency-critical slices on the slice-aware architecture.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128626247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Unified, Flexible Framework in Network Topology Generation for Distributed Machine Learning 分布式机器学习网络拓扑生成的统一、灵活框架
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3603132
Jianhao Liu, Xiaoyan Li, Yanhua Liu, Weibei Fan
In this study, we propose a unified framework for designing a class of server-centric network topologies for DML by adopting top-down design method and combinatorial design theory. Simulation results show that this flexible framework is capable of effectively supporting various DML tasks. Our framework can generate compatible topologies that meet various resource constraints and different DML tasks.
本文采用自顶向下的设计方法和组合设计理论,提出了一种统一的DML服务器中心网络拓扑设计框架。仿真结果表明,该框架能够有效地支持各种DML任务。我们的框架可以生成兼容的拓扑,以满足各种资源约束和不同的DML任务。
{"title":"A Unified, Flexible Framework in Network Topology Generation for Distributed Machine Learning","authors":"Jianhao Liu, Xiaoyan Li, Yanhua Liu, Weibei Fan","doi":"10.1145/3600061.3603132","DOIUrl":"https://doi.org/10.1145/3600061.3603132","url":null,"abstract":"In this study, we propose a unified framework for designing a class of server-centric network topologies for DML by adopting top-down design method and combinatorial design theory. Simulation results show that this flexible framework is capable of effectively supporting various DML tasks. Our framework can generate compatible topologies that meet various resource constraints and different DML tasks.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131465620","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Training ChatGPT-like Models with In-network Computation 用网络内计算训练类chatgpt模型
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3603136
Shuhao Fu, Yong Liao, Pengyuan Zhou
ChatGPT shows the enormous potential of large language models (LLMs). These models can easily reach the size of billions of parameters and create training difficulties for the majority. We propose a paradigm to train LLMs using distributed in-network computation on routers. Our preliminary result shows that our design allows LLMs to be trained at a reasonable learning rate without demanding extensive GPU resources.
ChatGPT显示了大型语言模型(llm)的巨大潜力。这些模型可以很容易地达到数十亿个参数的规模,并为大多数人带来训练困难。我们提出了一个在路由器上使用分布式网络内计算来训练llm的范例。我们的初步结果表明,我们的设计允许llm以合理的学习率进行训练,而不需要大量的GPU资源。
{"title":"Training ChatGPT-like Models with In-network Computation","authors":"Shuhao Fu, Yong Liao, Pengyuan Zhou","doi":"10.1145/3600061.3603136","DOIUrl":"https://doi.org/10.1145/3600061.3603136","url":null,"abstract":"ChatGPT shows the enormous potential of large language models (LLMs). These models can easily reach the size of billions of parameters and create training difficulties for the majority. We propose a paradigm to train LLMs using distributed in-network computation on routers. Our preliminary result shows that our design allows LLMs to be trained at a reasonable learning rate without demanding extensive GPU resources.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131090488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diagnosing Distributed Routing Configurations Using Sequential Program Analysis 使用顺序程序分析诊断分布式路由配置
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3600065
Xing Fang, Lizhao You, Qiao Xiang, Hanyang Shao, Gao Han, Ziyi Wang, J. Shu, L. Kong
In this paper, we show that by capturing the causal relationship among the computation of routers, one can transform the distributed program composed of routing processes into a sequential program, which allows the use of various sequential program analysis theories and tools for diagnosing and repairing routing configuration errors. This insight sheds light on future research on automatic network configuration diagnosis and repair. To demonstrate its feasibility and generality, we give the preliminary design of two methods for routing configuration error diagnosis: (1) data flow analysis using minimal unsatisfiable core and error invariants; and (2) control flow analysis using selective symbolic execution. Using real-world topologies and synthetic configurations, we show that both methods can effectively find errors in routing configurations while incurring reasonable overhead.
在本文中,我们展示了通过捕获路由器计算之间的因果关系,可以将路由过程组成的分布式程序转换为顺序程序,从而允许使用各种顺序程序分析理论和工具来诊断和修复路由配置错误。这一发现为未来网络配置自动诊断和修复的研究提供了启示。为了证明其可行性和通用性,我们给出了两种路由配置错误诊断方法的初步设计:(1)利用最小不满意核和误差不变量进行数据流分析;(2)选择性符号执行控制流分析。通过使用实际拓扑和合成配置,我们证明了这两种方法都可以有效地发现路由配置中的错误,同时产生合理的开销。
{"title":"Diagnosing Distributed Routing Configurations Using Sequential Program Analysis","authors":"Xing Fang, Lizhao You, Qiao Xiang, Hanyang Shao, Gao Han, Ziyi Wang, J. Shu, L. Kong","doi":"10.1145/3600061.3600065","DOIUrl":"https://doi.org/10.1145/3600061.3600065","url":null,"abstract":"In this paper, we show that by capturing the causal relationship among the computation of routers, one can transform the distributed program composed of routing processes into a sequential program, which allows the use of various sequential program analysis theories and tools for diagnosing and repairing routing configuration errors. This insight sheds light on future research on automatic network configuration diagnosis and repair. To demonstrate its feasibility and generality, we give the preliminary design of two methods for routing configuration error diagnosis: (1) data flow analysis using minimal unsatisfiable core and error invariants; and (2) control flow analysis using selective symbolic execution. Using real-world topologies and synthetic configurations, we show that both methods can effectively find errors in routing configurations while incurring reasonable overhead.","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130683950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
DFR: Dynamic-thresold Fault-tolerant Routing for Fat Tree DFR:胖树的动态阈值容错路由
Pub Date : 2023-06-29 DOI: 10.1145/3600061.3603125
Binyan Lan, Fei Lei, Ketong Wu, Dezun Dong
{"title":"DFR: Dynamic-thresold Fault-tolerant Routing for Fat Tree","authors":"Binyan Lan, Fei Lei, Ketong Wu, Dezun Dong","doi":"10.1145/3600061.3603125","DOIUrl":"https://doi.org/10.1145/3600061.3603125","url":null,"abstract":"","PeriodicalId":228934,"journal":{"name":"Proceedings of the 7th Asia-Pacific Workshop on Networking","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133022014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 7th Asia-Pacific Workshop on Networking
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1