Proceedings of the ACM on Measurement and Analysis of Computing Systems最新文献

A Large Scale Study and Classification of VirusTotal Reports on Phishing and Malware URLs 关于网络钓鱼和恶意软件 URL 的 VirusTotal 报告的大规模研究和分类

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-12-07 DOI: 10.1145/3626790

Euijin Choo, Mohamed Nabeel, Doowon Kim, Ravindu De Silva, Ting Yu, Issa Khalil

VirusTotal (VT) is a widely used scanning service for researchers and practitioners to label malicious entities and predict new security threats. Unfortunately, it is little known to the end-users how VT URL scanners decide on the maliciousness of entities and the attack types they are involved in (e.g., phishing or malware-hosting websites). In this paper, we conduct a systematic comparative study on VT URL scanners' behavior for different attack types of malicious URLs, in terms of 1) detection specialties, 2) stability, 3) correlations between scanners, and 4) lead/lag behaviors. Our findings highlight that the VT scanners commonly disagree with each other on their detection and attack type classification, leading to challenges in ascertaining the maliciousness of a URL and taking prompt mitigation actions according to different attack types. This motivates us to present a new highly accurate classifier that helps correctly identify the attack types of malicious URLs at the early stage. This in turn assists practitioners in performing better threat aggregation and choosing proper mitigation actions for different attack types

VirusTotal (VT) 是一种广泛使用的扫描服务，供研究人员和从业人员标记恶意实体并预测新的安全威胁。遗憾的是，最终用户对 VT URL 扫描仪如何判定实体的恶意程度及其参与的攻击类型（如网络钓鱼或恶意软件托管网站）知之甚少。在本文中，我们从以下几个方面对 VT URL 扫描仪针对不同攻击类型的恶意 URL 的行为进行了系统的比较研究：1）检测专业性；2）稳定性；3）扫描仪之间的相关性；4）领先/滞后行为。我们的研究结果表明，VT 扫描仪在检测和攻击类型分类方面普遍存在意见分歧，这给确定 URL 的恶意程度并根据不同攻击类型及时采取缓解措施带来了挑战。这促使我们提出一种新的高精度分类器，帮助在早期阶段正确识别恶意 URL 的攻击类型。这反过来又有助于从业人员更好地进行威胁汇总，并针对不同的攻击类型选择适当的缓解措施。

{"title":"A Large Scale Study and Classification of VirusTotal Reports on Phishing and Malware URLs","authors":"Euijin Choo, Mohamed Nabeel, Doowon Kim, Ravindu De Silva, Ting Yu, Issa Khalil","doi":"10.1145/3626790","DOIUrl":"https://doi.org/10.1145/3626790","url":null,"abstract":"VirusTotal (VT) is a widely used scanning service for researchers and practitioners to label malicious entities and predict new security threats. Unfortunately, it is little known to the end-users how VT URL scanners decide on the maliciousness of entities and the attack types they are involved in (e.g., phishing or malware-hosting websites). In this paper, we conduct a systematic comparative study on VT URL scanners' behavior for different attack types of malicious URLs, in terms of 1) detection specialties, 2) stability, 3) correlations between scanners, and 4) lead/lag behaviors. Our findings highlight that the VT scanners commonly disagree with each other on their detection and attack type classification, leading to challenges in ascertaining the maliciousness of a URL and taking prompt mitigation actions according to different attack types. This motivates us to present a new highly accurate classifier that helps correctly identify the attack types of malicious URLs at the early stage. This in turn assists practitioners in performing better threat aggregation and choosing proper mitigation actions for different attack types","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"97 ","pages":"1 - 26"},"PeriodicalIF":0.0,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139011791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

CoBF: Coordinated Beamforming in Dense mmWave Networks 密集毫米波网络中的协调波束形成

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589975

Ding Zhang, P. Santhalingam, P. Pathak, Zizhan Zheng

With MIMO and enhanced beamforming features, IEEE 802.11ay is poised to create the next generation of mmWave WLANs that can provide over 100 Gbps data rate. However, beamforming between densely deployed APs and clients incurs unacceptable overhead. On the other hand, the absence of up-to-date beamforming information restricts the diversity gains available through MIMO and multi-users, reducing the overall network capacity. This paper presents a novel approach of "coordinated beamforming" (called CoBF) where only a small subset of APs are selected for beamforming in the 802.11ay mmWave WLANs. Based on the concept of uncertainty, CoBF predicts the APs whose beamforming information is likely outdated and needs updating. The proposed approach complements the existing per-link beamforming solutions and extends their effectiveness from link-level to network-level. Furthermore, CoBF leverages the AP uncertainty to create MU-MIMO groups through interference-aware scheduling in 802.11ay WLANs. With extensive experimentation and simulations, we show that CoBF can significantly reduce beamforming overhead and improve network capacity for 802.11ay WLANs.

凭借MIMO和增强的波束形成功能，IEEE 802.11ay准备创建下一代毫米波wlan，可以提供超过100 Gbps的数据速率。然而，密集部署的ap和客户端之间的波束形成会产生不可接受的开销。另一方面，缺乏最新的波束形成信息限制了通过MIMO和多用户获得的分集增益，降低了整体网络容量。本文提出了一种新的“协调波束形成”(称为CoBF)方法，在802.11ay毫米波wlan中，只有一小部分ap被选择进行波束形成。CoBF基于不确定性的概念，对波束形成信息可能过时、需要更新的ap进行预测。该方法补充了现有的逐链路波束形成解决方案，并将其有效性从链路级扩展到网络级。此外，CoBF利用AP不确定性在802.11ay wlan中通过干扰感知调度创建MU-MIMO组。通过大量的实验和模拟，我们表明CoBF可以显著降低波束形成开销并提高802.11ay wlan的网络容量。

{"title":"CoBF: Coordinated Beamforming in Dense mmWave Networks","authors":"Ding Zhang, P. Santhalingam, P. Pathak, Zizhan Zheng","doi":"10.1145/3589975","DOIUrl":"https://doi.org/10.1145/3589975","url":null,"abstract":"With MIMO and enhanced beamforming features, IEEE 802.11ay is poised to create the next generation of mmWave WLANs that can provide over 100 Gbps data rate. However, beamforming between densely deployed APs and clients incurs unacceptable overhead. On the other hand, the absence of up-to-date beamforming information restricts the diversity gains available through MIMO and multi-users, reducing the overall network capacity. This paper presents a novel approach of \"coordinated beamforming\" (called CoBF) where only a small subset of APs are selected for beamforming in the 802.11ay mmWave WLANs. Based on the concept of uncertainty, CoBF predicts the APs whose beamforming information is likely outdated and needs updating. The proposed approach complements the existing per-link beamforming solutions and extends their effectiveness from link-level to network-level. Furthermore, CoBF leverages the AP uncertainty to create MU-MIMO groups through interference-aware scheduling in 802.11ay WLANs. With extensive experimentation and simulations, we show that CoBF can significantly reduce beamforming overhead and improve network capacity for 802.11ay WLANs.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127640297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Smash: Flexible, Fast, and Resource-efficient Placement and Lookup of Distributed Storage Smash:灵活、快速、资源高效的分布式存储放置和查找

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589977

Yi Liu, Shouqian Shi, Minghao Xie, Heiner Litz, Chen Qian

Large-scale distributed storage systems, such as object stores, usually apply hashing-based placement and lookup methods to achieve scalability and resource efficiency. However, when object locations are determined by hash values, placement becomes inflexible, failing to optimize or satisfy application requirements such as load balance, failure tolerance, parallelism, and network/system performance. This work presents a novel solution to achieve the best of two worlds: flexibility while maintaining cost-effectiveness and scalability. The proposed method Smash is an object placement and lookup method that achieves full placement flexibility, balanced load, low resource cost, and short latency. Smash utilizes a recent space-efficient data structure and applies it to object-location lookups. We implement Smash as a prototype system and evaluate it in a public cloud. The analysis and experimental results show that Smash achieves full placement flexibility, fast storage operations, fast recovery from node dynamics, and lower DRAM cost (<60%) compared to existing hash-based solutions such as Ceph and MapX.

大型分布式存储系统(如对象存储)通常应用基于散列的放置和查找方法来实现可伸缩性和资源效率。但是，当对象位置由哈希值决定时，位置就变得不灵活，无法优化或满足应用程序需求，例如负载平衡、容错、并行性和网络/系统性能。这项工作提供了一种新颖的解决方案，以实现两个世界的最佳效果:灵活性，同时保持成本效益和可伸缩性。提出的Smash方法是一种对象放置和查找方法，实现了完全的放置灵活性、负载均衡、资源成本低、延迟短。Smash利用最新的空间高效数据结构，并将其应用于对象位置查找。我们将Smash作为原型系统实现，并在公共云中对其进行评估。分析和实验结果表明，与现有的基于hash的解决方案(如Ceph和MapX)相比，Smash实现了完全的放置灵活性、快速的存储操作、从节点动态中快速恢复以及更低的DRAM成本(<60%)。

{"title":"Smash: Flexible, Fast, and Resource-efficient Placement and Lookup of Distributed Storage","authors":"Yi Liu, Shouqian Shi, Minghao Xie, Heiner Litz, Chen Qian","doi":"10.1145/3589977","DOIUrl":"https://doi.org/10.1145/3589977","url":null,"abstract":"Large-scale distributed storage systems, such as object stores, usually apply hashing-based placement and lookup methods to achieve scalability and resource efficiency. However, when object locations are determined by hash values, placement becomes inflexible, failing to optimize or satisfy application requirements such as load balance, failure tolerance, parallelism, and network/system performance. This work presents a novel solution to achieve the best of two worlds: flexibility while maintaining cost-effectiveness and scalability. The proposed method Smash is an object placement and lookup method that achieves full placement flexibility, balanced load, low resource cost, and short latency. Smash utilizes a recent space-efficient data structure and applies it to object-location lookups. We implement Smash as a prototype system and evaluate it in a public cloud. The analysis and experimental results show that Smash achieves full placement flexibility, fast storage operations, fast recovery from node dynamics, and lower DRAM cost (<60%) compared to existing hash-based solutions such as Ceph and MapX.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122800749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Real-time Spread Burst Detection in Data Streaming 数据流中的实时扩展突发检测

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589979

Haibo Wang, D. Melissourgos, Chaoyi Ma, Shigang Chen

Data streaming has many applications in network monitoring, web services, e-commerce, stock trading, social networks, and distributed sensing. This paper introduces a new problem of real-time burst detection in flow spread, which differs from the traditional problem of burst detection in flow size. It is practically significant with potential applications in cybersecurity, network engineering, and trend identification on the Internet. It is a challenging problem because estimating flow spread requires us to remember all past data items and detecting bursts in real time requires us to minimize spread estimation overhead, which was not the priority in most prior work. This paper provides the first efficient, real-time solution for spread burst detection. It is designed based on a new real-time super spreader identifier, which outperforms the state of the art in terms of both accuracy and processing overhead. The super spreader identifier is in turn based on a new sketch design for real-time spread estimation, which outperforms the best existing sketches.

数据流在网络监控、web服务、电子商务、股票交易、社交网络和分布式传感等领域有着广泛的应用。本文提出了一种新的基于流量扩展的突发实时检测问题，它不同于传统的基于流量大小的突发实时检测问题。它在网络安全、网络工程和互联网趋势识别方面具有潜在的应用价值。这是一个具有挑战性的问题，因为估计流量扩散需要我们记住所有过去的数据项，而实时检测突发需要我们最小化扩散估计开销，而这在大多数先前的工作中并不是优先考虑的。本文提供了第一个有效、实时的扩展突发检测解决方案。它是基于一种新的实时超级扩展标识符设计的，该标识符在精度和处理开销方面都优于目前的状态。超级传播标识符基于一种新的实时传播估计草图设计，优于现有的最佳草图。

引用次数: 1

SplitRPC: A {Control + Data} Path Splitting RPC Stack for ML Inference Serving SplitRPC:一个用于ML推理服务的{Control + Data}路径分裂RPC堆栈

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589974

Adithya Kumar, A. Sivasubramaniam, T. Zhu

The growing adoption of hardware accelerators driven by their intelligent compiler and runtime system counterparts has democratized ML services and precipitously reduced their execution times. This motivates us to shift our attention to efficiently serve these ML services under distributed settings and characterize the overheads imposed by the RPC mechanism ('RPC tax') when serving them on accelerators. The RPC implementations designed over the years implicitly assume the host CPU services the requests, and we focus on expanding such works towards accelerator-based services. While recent proposals calling for SmartNICs to take on this task are reasonable for simple kernels, serving complex ML models requires a more nuanced view to optimize both the data-path and the control/orchestration of these accelerators. We program today's commodity network interface cards (NICs) to split the control and data paths for effective transfer of control while efficiently transferring the payload to the accelerator. As opposed to unified approaches that bundle these paths together, limiting the flexibility in each of these paths, we design and implement SplitRPC - a control + data path optimizing RPC mechanism for ML inference serving. SplitRPC allows us to optimize the datapath to the accelerator while simultaneously allowing the CPU to maintain full orchestration capabilities. We implement SplitRPC on both commodity NICs and SmartNICs and demonstrate how GPU-based ML services running different compiler/runtime systems can benefit. For a variety of ML models served using different inference runtimes, we demonstrate that SplitRPC is effective in minimizing the RPC tax while providing significant gains in throughput and latency over existing kernel by-pass approaches, without requiring expensive SmartNIC devices.

由智能编译器和运行时系统驱动的硬件加速器的日益普及，使ML服务大众化，并大大缩短了它们的执行时间。这促使我们将注意力转移到在分布式设置下有效地服务这些ML服务，并描述RPC机制在加速器上服务时强加的开销(“RPC税”)。多年来设计的RPC实现隐式地假设主机CPU为请求提供服务，我们将重点放在将此类工作扩展到基于加速器的服务上。虽然最近的建议要求smartnic承担这项任务对于简单的内核来说是合理的，但服务复杂的ML模型需要更细致的视角来优化这些加速器的数据路径和控制/编组。我们对今天的商品网络接口卡(nic)进行编程，以分离控制和数据路径，以便有效地传输控制，同时有效地将有效载荷传输到加速器。与将这些路径捆绑在一起的统一方法相反，限制了每个路径的灵活性，我们设计并实现了SplitRPC——一种用于ML推理服务的控制+数据路径优化RPC机制。SplitRPC允许我们优化到加速器的数据路径，同时允许CPU保持完整的编排功能。我们在商品nic和smartnic上实现SplitRPC，并演示运行不同编译器/运行时系统的基于gpu的ML服务如何受益。对于使用不同推理运行时服务的各种ML模型，我们证明SplitRPC在最小化RPC税方面是有效的，同时与现有的内核旁路方法相比，在吞吐量和延迟方面提供了显着的收益，而不需要昂贵的SmartNIC设备。

{"title":"SplitRPC: A {Control + Data} Path Splitting RPC Stack for ML Inference Serving","authors":"Adithya Kumar, A. Sivasubramaniam, T. Zhu","doi":"10.1145/3589974","DOIUrl":"https://doi.org/10.1145/3589974","url":null,"abstract":"The growing adoption of hardware accelerators driven by their intelligent compiler and runtime system counterparts has democratized ML services and precipitously reduced their execution times. This motivates us to shift our attention to efficiently serve these ML services under distributed settings and characterize the overheads imposed by the RPC mechanism ('RPC tax') when serving them on accelerators. The RPC implementations designed over the years implicitly assume the host CPU services the requests, and we focus on expanding such works towards accelerator-based services. While recent proposals calling for SmartNICs to take on this task are reasonable for simple kernels, serving complex ML models requires a more nuanced view to optimize both the data-path and the control/orchestration of these accelerators. We program today's commodity network interface cards (NICs) to split the control and data paths for effective transfer of control while efficiently transferring the payload to the accelerator. As opposed to unified approaches that bundle these paths together, limiting the flexibility in each of these paths, we design and implement SplitRPC - a control + data path optimizing RPC mechanism for ML inference serving. SplitRPC allows us to optimize the datapath to the accelerator while simultaneously allowing the CPU to maintain full orchestration capabilities. We implement SplitRPC on both commodity NICs and SmartNICs and demonstrate how GPU-based ML services running different compiler/runtime systems can benefit. For a variety of ML models served using different inference runtimes, we demonstrate that SplitRPC is effective in minimizing the RPC tax while providing significant gains in throughput and latency over existing kernel by-pass approaches, without requiring expensive SmartNIC devices.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122040162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards Accelerating Data Intensive Application's Shuffle Process Using SmartNICs 利用smartnic加速数据密集型应用的Shuffle进程

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589980

Jia-Jen Lin, T. Ji, Xiangpeng Hao, Hokeun Cha, Yanfang Le, Xiangyao Yu, Aditya Akella

The wide adoption of the emerging SmartNIC technology creates new opportunities to offload application-level computation into the networking layer, which frees the burden of host CPUs, leading to performance improvement. Shuffle, the all-to-all data exchange process, is a critical building block for network communication in distributed data-intensive applications and can potentially benefit from SmartNICs. In this paper, we develop SmartShuffle, which accelerates the data-intensive application's shuffle process by offloading various computation tasks into the SmartNIC devices. SmartShuffle supports offloading both low-level network functions, including data partitioning and network transport, and high-level computation tasks, including filtering, aggregation, and sorting. SmartShuffle adopts a coordinated offload architecture to make sender-side and receiver-side SmartNICs jointly contribute to the benefits of shuffle computation offload. SmartShuffle carefully manages the tight and time-varying computation and memory constraints on the device. We propose a liquid offloading approach, which dynamically migrates operators between the host CPU and the SmartNIC at runtime such that resources in both devices are fully utilized. We prototype SmartShuffle on the Stingray SoC SmartNICs and plug it into Spark. Our evaluation shows that SmartShuffle improves host CPU efficiency and I/O efficiency with lower job completion time. SmartShuffle outperforms Spark, and Spark RDMA by up to 40% on TPC-H.

新兴SmartNIC技术的广泛采用为将应用级计算转移到网络层创造了新的机会，从而减轻了主机cpu的负担，从而提高了性能。Shuffle是全对全数据交换过程，是分布式数据密集型应用中网络通信的关键组成部分，可以从smartnic中获益。在本文中，我们开发了SmartShuffle，它通过将各种计算任务卸载到SmartNIC设备中来加速数据密集型应用程序的shuffle过程。SmartShuffle既可以卸载数据分区、网络传输等底层网络功能，也可以卸载过滤、聚合、排序等高层计算任务。SmartShuffle采用协调的分流架构，使发送端和接收端smartnic共同实现shuffle计算分流的好处。SmartShuffle仔细地管理设备上紧的和时变的计算和内存约束。我们提出了一种液体卸载方法，该方法在运行时在主机CPU和SmartNIC之间动态迁移操作符，从而充分利用两个设备中的资源。我们在Stingray SoC smartnic上原型化SmartShuffle，并将其插入Spark。我们的评估表明，SmartShuffle提高了主机CPU效率和I/O效率，同时缩短了作业完成时间。在TPC-H上，SmartShuffle的性能比Spark和Spark RDMA高出40%。

{"title":"Towards Accelerating Data Intensive Application's Shuffle Process Using SmartNICs","authors":"Jia-Jen Lin, T. Ji, Xiangpeng Hao, Hokeun Cha, Yanfang Le, Xiangyao Yu, Aditya Akella","doi":"10.1145/3589980","DOIUrl":"https://doi.org/10.1145/3589980","url":null,"abstract":"The wide adoption of the emerging SmartNIC technology creates new opportunities to offload application-level computation into the networking layer, which frees the burden of host CPUs, leading to performance improvement. Shuffle, the all-to-all data exchange process, is a critical building block for network communication in distributed data-intensive applications and can potentially benefit from SmartNICs. In this paper, we develop SmartShuffle, which accelerates the data-intensive application's shuffle process by offloading various computation tasks into the SmartNIC devices. SmartShuffle supports offloading both low-level network functions, including data partitioning and network transport, and high-level computation tasks, including filtering, aggregation, and sorting. SmartShuffle adopts a coordinated offload architecture to make sender-side and receiver-side SmartNICs jointly contribute to the benefits of shuffle computation offload. SmartShuffle carefully manages the tight and time-varying computation and memory constraints on the device. We propose a liquid offloading approach, which dynamically migrates operators between the host CPU and the SmartNIC at runtime such that resources in both devices are fully utilized. We prototype SmartShuffle on the Stingray SoC SmartNICs and plug it into Spark. Our evaluation shows that SmartShuffle improves host CPU efficiency and I/O efficiency with lower job completion time. SmartShuffle outperforms Spark, and Spark RDMA by up to 40% on TPC-H.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123640944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Memtrade: Marketplace for Disaggregated Memory Clouds Memtrade:分解内存云的市场

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589985

H. Maruf, Yuhong Zhong, Hongyi Wang, Mosharaf Chowdhury, Asaf Cidon, Carl A. Waldspurger

We present Memtrade, the first practical marketplace for disaggregated memory clouds. Clouds introduce a set of unique challenges for resource disaggregation across different tenants, including resource harvesting, isolation, and matching. Memtrade allows producer virtual machines (VMs) to lease both their unallocated memory and allocated-but-idle application memory to remote consumer VMs for a limited period of time. Memtrade does not require any modifications to host-level system software or support from the cloud provider. It harvests producer memory using an application-aware control loop to form a distributed transient remote memory pool with minimal performance impact; it employs a broker to match producers with consumers while satisfying performance constraints; and it exposes the matched memory to consumers through different abstractions. As a proof of concept, we propose two such memory access interfaces for Memtrade consumers -- a transient KV cache for specified applications and a swap interface that is application-transparent. Our evaluation using real-world cluster traces shows that Memtrade provides significant performance benefit for consumers (improving average read latency up to 2.8X) while preserving confidentiality and integrity, with little impact on producer applications (degrading performance by less than 2.1%).

我们介绍Memtrade，这是第一个用于分解内存云的实用市场。云为跨不同租户的资源分解带来了一系列独特的挑战，包括资源收集、隔离和匹配。Memtrade允许生产者虚拟机(vm)在有限的时间内将其未分配的内存和已分配但空闲的应用程序内存租赁给远程消费者虚拟机。Memtrade不需要对主机级系统软件进行任何修改，也不需要云提供商的支持。它使用应用程序感知控制循环来获取生产者内存，形成一个对性能影响最小的分布式瞬态远程内存池;它使用一个中介来匹配生产者和消费者，同时满足性能约束;它通过不同的抽象将匹配的内存公开给消费者。作为概念验证，我们为Memtrade消费者提出了两个这样的内存访问接口——一个用于指定应用程序的瞬时KV缓存和一个应用程序透明的交换接口。我们使用真实集群跟踪进行的评估表明，Memtrade为消费者提供了显著的性能优势(将平均读取延迟提高了2.8倍)，同时保持了机密性和完整性，对生产者应用程序的影响很小(性能降低不到2.1%)。

{"title":"Memtrade: Marketplace for Disaggregated Memory Clouds","authors":"H. Maruf, Yuhong Zhong, Hongyi Wang, Mosharaf Chowdhury, Asaf Cidon, Carl A. Waldspurger","doi":"10.1145/3589985","DOIUrl":"https://doi.org/10.1145/3589985","url":null,"abstract":"We present Memtrade, the first practical marketplace for disaggregated memory clouds. Clouds introduce a set of unique challenges for resource disaggregation across different tenants, including resource harvesting, isolation, and matching. Memtrade allows producer virtual machines (VMs) to lease both their unallocated memory and allocated-but-idle application memory to remote consumer VMs for a limited period of time. Memtrade does not require any modifications to host-level system software or support from the cloud provider. It harvests producer memory using an application-aware control loop to form a distributed transient remote memory pool with minimal performance impact; it employs a broker to match producers with consumers while satisfying performance constraints; and it exposes the matched memory to consumers through different abstractions. As a proof of concept, we propose two such memory access interfaces for Memtrade consumers -- a transient KV cache for specified applications and a swap interface that is application-transparent. Our evaluation using real-world cluster traces shows that Memtrade provides significant performance benefit for consumers (improving average read latency up to 2.8X) while preserving confidentiality and integrity, with little impact on producer applications (degrading performance by less than 2.1%).","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126013072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

POMACS V7, N2, June 2023 Editorial 《POMACS》V7, N2, 2023年6月社论

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-05-19 DOI: 10.1145/3589972

K. Avrachenkov, P. Gill, B. Urgaonkar

The Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS) focuses on the measurement and performance evaluation of computer systems and operates in close collaboration with the ACM Special Interest Group SIGMETRICS. All papers in this issue of POMACS will be presented at the ACM SIGMETRICS 2023 conference on June 19-23, 2023, in Orlando, Florida, USA. These papers have been selected during the winter submission round by the 91 members of the ACM SIGMETRICS 2023 program committee via a rigorous review process. Each paper was conditionally accepted (and shepherded), allowed a "one-shot" revision (to be resubmitted to one of the subsequent three SIGMETRICS deadlines), or rejected (with re-submission allowed after a year). For this issue, which represents the winter deadline, POMACS is publishing 11 papers out of 130 submissions. All submissions received at least 3 reviews and borderline cases were extensively discussed during the online program committee meeting. Based on the indicated track(s), roughly 28% of the submissions were in the Theory track, 44% were in the Measurement & Applied Modeling track, 43% were in the Systems track, and 22% were in the Learning track. Many individuals contributed to the success of this issue of POMACS. First, we would like to thank the authors, who submitted their best work to SIGMETRICS/POMACS. Second, we would like to thank the program committee members who provided constructive feedback in their reviews to authors and participated in the online discussions and program committee meeting. We also thank the several external reviewers who provided their expert opinion on specific submissions that required additional input. We are also grateful to the SIGMETRICS Board Chair, Giuliano Casale, and to past program committee Chairs, Niklas Carlsson, Edith Cohen, and Philippe Robert, who provided a wealth of information and guidance. Finally, we are grateful to the Organization Committee and to the SIGMETRICS Board for their ongoing efforts and initiatives for creating an exciting program for ACM SIGMETRICS 2023.

ACM计算系统测量与分析论文集(POMACS)侧重于计算机系统的测量和性能评估，并与ACM特别兴趣小组SIGMETRICS密切合作。本期《POMACS》的所有论文将于2023年6月19日至23日在美国佛罗里达州奥兰多举行的ACM SIGMETRICS 2023会议上发表。这些论文是由ACM SIGMETRICS 2023项目委员会的91名成员通过严格的审查程序在冬季提交的一轮中选出的。每篇论文都被有条件地接受(和指导)，允许“一次性”修改(在随后的三个SIGMETRICS截止日期之一重新提交)，或者拒绝(在一年后允许重新提交)。这期是冬季的最后期限，POMACS发表了130篇投稿中的11篇。所有提交的作品都至少接受了3次评审，在在线项目委员会会议上，对边缘性案例进行了广泛讨论。根据所指示的轨道，大约28%的提交在理论轨道，44%在测量和应用建模轨道，43%在系统轨道，22%在学习轨道。许多人对本期《POMACS》的成功做出了贡献。首先，我们要感谢作者，他们向SIGMETRICS/POMACS提交了他们最好的作品。其次，我们要感谢项目委员会成员，他们在对作者的评审中提供了建设性的反馈，并参与了在线讨论和项目委员会会议。我们还要感谢几位外部审稿人，他们就需要额外投入的具体提交文件提供了专家意见。我们还要感谢SIGMETRICS董事会主席Giuliano Casale，以及过去的项目委员会主席Niklas Carlsson、Edith Cohen和Philippe Robert，他们提供了丰富的信息和指导。最后，我们感谢组织委员会和SIGMETRICS董事会为ACM SIGMETRICS 2023创建一个令人兴奋的项目所做的持续努力和倡议。

{"title":"POMACS V7, N2, June 2023 Editorial","authors":"K. Avrachenkov, P. Gill, B. Urgaonkar","doi":"10.1145/3589972","DOIUrl":"https://doi.org/10.1145/3589972","url":null,"abstract":"The Proceedings of the ACM on Measurement and Analysis of Computing Systems (POMACS) focuses on the measurement and performance evaluation of computer systems and operates in close collaboration with the ACM Special Interest Group SIGMETRICS. All papers in this issue of POMACS will be presented at the ACM SIGMETRICS 2023 conference on June 19-23, 2023, in Orlando, Florida, USA. These papers have been selected during the winter submission round by the 91 members of the ACM SIGMETRICS 2023 program committee via a rigorous review process. Each paper was conditionally accepted (and shepherded), allowed a \"one-shot\" revision (to be resubmitted to one of the subsequent three SIGMETRICS deadlines), or rejected (with re-submission allowed after a year). For this issue, which represents the winter deadline, POMACS is publishing 11 papers out of 130 submissions. All submissions received at least 3 reviews and borderline cases were extensively discussed during the online program committee meeting. Based on the indicated track(s), roughly 28% of the submissions were in the Theory track, 44% were in the Measurement & Applied Modeling track, 43% were in the Systems track, and 22% were in the Learning track. Many individuals contributed to the success of this issue of POMACS. First, we would like to thank the authors, who submitted their best work to SIGMETRICS/POMACS. Second, we would like to thank the program committee members who provided constructive feedback in their reviews to authors and participated in the online discussions and program committee meeting. We also thank the several external reviewers who provided their expert opinion on specific submissions that required additional input. We are also grateful to the SIGMETRICS Board Chair, Giuliano Casale, and to past program committee Chairs, Niklas Carlsson, Edith Cohen, and Philippe Robert, who provided a wealth of information and guidance. Finally, we are grateful to the Organization Committee and to the SIGMETRICS Board for their ongoing efforts and initiatives for creating an exciting program for ACM SIGMETRICS 2023.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114352070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

JS Capsules: A Framework for Capturing Fine-grained JavaScript Memory Measurements for the Mobile Web JS胶囊:一个为移动Web捕获细粒度JavaScript内存测量的框架

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-02-27 DOI: 10.1145/3579327

Usama Naseer, Theophilus A. Benson

Understanding the resource consumption of the mobile web is an important topic that has garnered much attention in recent years. However, existing works mostly focus on the networking or computational aspects of the mobile web and largely ignore memory, which is an important aspect given the mobile web's reliance on resource-heavy JavaScript. In this paper, we propose a framework, called JS Capsules, for characterizing the memory of JavaScript functions and, using this framework, we investigate the key browser mechanics that contribute to the memory overhead. Leveraging our framework on a testbed of Android mobile phones, we conduct measurements of the Alexa top 1K websites. While most existing frameworks focus on V8 - the JavaScript engine used in most popular browsers - in the context of memory, our measurements show that the memory implications of JavaScript extends far beyond V8 due to the cascading effects that certain JavaScript calls have on the browser's rendering mechanics. We quantify and highlight the direct impact that website DOM have on JavaScript memory overhead and present, to our knowledge, the first root-cause analysis to dissect and characterize their impact on JavaScript memory overheads.

了解移动网络的资源消耗是近年来备受关注的一个重要话题。然而，现有的工作大多集中在移动web的网络或计算方面，而很大程度上忽略了内存，这是一个重要的方面，因为移动web依赖于资源密集型的JavaScript。在本文中，我们提出了一个框架，称为JS胶囊，用于表征JavaScript函数的内存，并使用该框架，我们研究了导致内存开销的关键浏览器机制。在Android手机的测试平台上利用我们的框架，我们对Alexa排名前1K的网站进行了测量。虽然大多数现有框架都关注V8——大多数流行浏览器中使用的JavaScript引擎——在内存环境中，我们的测量表明，由于某些JavaScript调用对浏览器渲染机制的级联效应，JavaScript对内存的影响远远超出了V8。我们量化并强调了网站DOM对JavaScript内存开销的直接影响，据我们所知，这是第一个剖析和描述其对JavaScript内存开销影响的根本原因分析。

{"title":"JS Capsules: A Framework for Capturing Fine-grained JavaScript Memory Measurements for the Mobile Web","authors":"Usama Naseer, Theophilus A. Benson","doi":"10.1145/3579327","DOIUrl":"https://doi.org/10.1145/3579327","url":null,"abstract":"Understanding the resource consumption of the mobile web is an important topic that has garnered much attention in recent years. However, existing works mostly focus on the networking or computational aspects of the mobile web and largely ignore memory, which is an important aspect given the mobile web's reliance on resource-heavy JavaScript. In this paper, we propose a framework, called JS Capsules, for characterizing the memory of JavaScript functions and, using this framework, we investigate the key browser mechanics that contribute to the memory overhead. Leveraging our framework on a testbed of Android mobile phones, we conduct measurements of the Alexa top 1K websites. While most existing frameworks focus on V8 - the JavaScript engine used in most popular browsers - in the context of memory, our measurements show that the memory implications of JavaScript extends far beyond V8 due to the cascading effects that certain JavaScript calls have on the browser's rendering mechanics. We quantify and highlight the direct impact that website DOM have on JavaScript memory overhead and present, to our knowledge, the first root-cause analysis to dissect and characterize their impact on JavaScript memory overheads.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130087289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Detecting and Measuring Security Risks of Hosting-Based Dangling Domains 基于主机的悬空域安全风险检测与度量

Proceedings of the ACM on Measurement and Analysis of Computing Systems

Pub Date : 2023-02-27 DOI: 10.1145/3579440

Mingming Zhang, Xiang Li, Baojun Liu, Jianyu Lu, Yiming Zhang, Jianjun Chen, Haixin Duan, S. Hao, Xiaofeng Zheng

Public hosting services provide convenience for domain owners to build web applications with better scalability and security. However, if a domain name points to released service endpoints (e.g., nameservers allocated by a provider), adversaries can take over the domain by applying the same endpoints. Such a security threat is called "hosting-based domain takeover''. In recent years, a large number of domain takeover incidents have occurred; even well-known websites like the subdomains of microsoft.com have been impacted. However, until now, there has been no effective detection system to identify these vulnerable domains on a large scale. In this paper, we fill this research gap by presenting a novel framework, HostingChecker, for detecting domain takeovers. Compared with previous work, HostingChecker expands the detection scope and improves the detection efficiency by: (i) systematically identifying vulnerable hosting services using a semi-automated method; and (ii) effectively detecting vulnerable domains through passive reconstruction of domain dependency chains. The framework enables us to detect the subdomains of Tranco sites on a daily basis. We evaluate the effectiveness of HostingChecker and eventually detect 10,351 subdomains from Tranco Top-1M apex domains vulnerable to domain takeover, which are over 8× more than previous findings. Furthermore, we conduct an in-depth security analysis on the affected vendors, like Amazon and Alibaba, and gain a suite of new insights, including flawed implementation of domain ownership validation. Following responsible disclosure processes, we have reported issues to the security response centers of affected vendors, and some (e.g., Baidu and Tencent) have adopted our mitigation.

公共托管服务为域名所有者构建具有更好可扩展性和安全性的web应用程序提供了便利。但是，如果域名指向已释放的服务端点(例如，由提供商分配的名称服务器)，攻击者可以通过应用相同的端点来接管该域。这种安全威胁被称为“基于主机的域名接管”。近年来，发生了大量域名接管事件;甚至像microsoft.com的子域名这样的知名网站也受到了影响。然而，到目前为止，还没有有效的检测系统来大规模地识别这些脆弱域。在本文中，我们通过提出一个新的框架HostingChecker来填补这一研究空白，该框架用于检测域名接管。与以往的工作相比，HostingChecker扩展了检测范围，提高了检测效率:(1)采用半自动化的方法系统地识别易受攻击的托管服务;(ii)通过被动重构领域依赖链，有效检测脆弱领域。该框架使我们能够每天检测Tranco站点的子域。我们评估了HostingChecker的有效性，最终从Tranco Top-1M顶级域名中检测出10,351个子域名易被域名接管，比之前的发现多出8倍以上。此外，我们对受影响的供应商(如亚马逊和阿里巴巴)进行了深入的安全分析，并获得了一套新的见解，包括域所有权验证的错误实现。根据负责任的披露流程，我们已经向受影响供应商的安全响应中心报告了问题，一些(例如百度和腾讯)已经采用了我们的缓解措施。

{"title":"Detecting and Measuring Security Risks of Hosting-Based Dangling Domains","authors":"Mingming Zhang, Xiang Li, Baojun Liu, Jianyu Lu, Yiming Zhang, Jianjun Chen, Haixin Duan, S. Hao, Xiaofeng Zheng","doi":"10.1145/3579440","DOIUrl":"https://doi.org/10.1145/3579440","url":null,"abstract":"Public hosting services provide convenience for domain owners to build web applications with better scalability and security. However, if a domain name points to released service endpoints (e.g., nameservers allocated by a provider), adversaries can take over the domain by applying the same endpoints. Such a security threat is called \"hosting-based domain takeover''. In recent years, a large number of domain takeover incidents have occurred; even well-known websites like the subdomains of microsoft.com have been impacted. However, until now, there has been no effective detection system to identify these vulnerable domains on a large scale. In this paper, we fill this research gap by presenting a novel framework, HostingChecker, for detecting domain takeovers. Compared with previous work, HostingChecker expands the detection scope and improves the detection efficiency by: (i) systematically identifying vulnerable hosting services using a semi-automated method; and (ii) effectively detecting vulnerable domains through passive reconstruction of domain dependency chains. The framework enables us to detect the subdomains of Tranco sites on a daily basis. We evaluate the effectiveness of HostingChecker and eventually detect 10,351 subdomains from Tranco Top-1M apex domains vulnerable to domain takeover, which are over 8× more than previous findings. Furthermore, we conduct an in-depth security analysis on the affected vendors, like Amazon and Alibaba, and gain a suite of new insights, including flawed implementation of domain ownership validation. Following responsible disclosure processes, we have reported issues to the security response centers of affected vendors, and some (e.g., Baidu and Tencent) have adopted our mitigation.","PeriodicalId":426760,"journal":{"name":"Proceedings of the ACM on Measurement and Analysis of Computing Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131255681","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1