Monica Chiosa, Thomas B. Preußer, Michaela Blott, Gustavo Alonso
{"title":"AMNES:利用 FPGA 加速数据相关性计算","authors":"Monica Chiosa, Thomas B. Preußer, Michaela Blott, Gustavo Alonso","doi":"10.14778/3625054.3625056","DOIUrl":null,"url":null,"abstract":"A widely used approach to characterize input data in both databases and ML is computing the correlation between attributes. The operation is supported by all major database engines and ML platforms. However, it is an expensive operation as the number of attributes involved grows. To address the issue, in this paper we introduce AMNES, a stream analytics system offloading the correlation operator into an FPGA-based network interface card. AMNES processes data at network line rate and the design can be used in combination with smart storage or SmartNICs to implement near data or in-network data processing. AMNES design goes beyond matrix multiplication and offers a customized solution for correlation computation bypassing the CPU. Our experiments show that AMNES can sustain streams arriving at 100 Gbps over an RDMA network, while requiring only ten milliseconds to compute the correlation coefficients among 64 streams, an order of magnitude better than competing CPU or GPU designs.","PeriodicalId":20467,"journal":{"name":"Proc. VLDB Endow.","volume":"196 1","pages":"4174-4187"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"AMNES: Accelerating the computation of data correlation using FPGAs\",\"authors\":\"Monica Chiosa, Thomas B. Preußer, Michaela Blott, Gustavo Alonso\",\"doi\":\"10.14778/3625054.3625056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A widely used approach to characterize input data in both databases and ML is computing the correlation between attributes. The operation is supported by all major database engines and ML platforms. However, it is an expensive operation as the number of attributes involved grows. To address the issue, in this paper we introduce AMNES, a stream analytics system offloading the correlation operator into an FPGA-based network interface card. AMNES processes data at network line rate and the design can be used in combination with smart storage or SmartNICs to implement near data or in-network data processing. AMNES design goes beyond matrix multiplication and offers a customized solution for correlation computation bypassing the CPU. Our experiments show that AMNES can sustain streams arriving at 100 Gbps over an RDMA network, while requiring only ten milliseconds to compute the correlation coefficients among 64 streams, an order of magnitude better than competing CPU or GPU designs.\",\"PeriodicalId\":20467,\"journal\":{\"name\":\"Proc. VLDB Endow.\",\"volume\":\"196 1\",\"pages\":\"4174-4187\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proc. VLDB Endow.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.14778/3625054.3625056\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proc. VLDB Endow.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.14778/3625054.3625056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
数据库和 ML 中广泛使用的一种表征输入数据的方法是计算属性之间的相关性。所有主要数据库引擎和 ML 平台都支持这种操作。然而,随着所涉及属性数量的增加,这一操作的成本也会随之增加。为了解决这个问题,我们在本文中介绍了 AMNES,这是一种流分析系统,可将相关运算器卸载到基于 FPGA 的网络接口卡中。AMNES 以网络线路速率处理数据,其设计可与智能存储或 SmartNIC 结合使用,以实现近距离数据或网络内数据处理。AMNES 的设计超越了矩阵乘法,为绕过 CPU 的相关计算提供了定制解决方案。我们的实验表明,AMNES 可以通过 RDMA 网络支持 100 Gbps 的数据流,而计算 64 个数据流之间的相关系数仅需 10 毫秒,比竞争对手的 CPU 或 GPU 设计高出一个数量级。
AMNES: Accelerating the computation of data correlation using FPGAs
A widely used approach to characterize input data in both databases and ML is computing the correlation between attributes. The operation is supported by all major database engines and ML platforms. However, it is an expensive operation as the number of attributes involved grows. To address the issue, in this paper we introduce AMNES, a stream analytics system offloading the correlation operator into an FPGA-based network interface card. AMNES processes data at network line rate and the design can be used in combination with smart storage or SmartNICs to implement near data or in-network data processing. AMNES design goes beyond matrix multiplication and offers a customized solution for correlation computation bypassing the CPU. Our experiments show that AMNES can sustain streams arriving at 100 Gbps over an RDMA network, while requiring only ten milliseconds to compute the correlation coefficients among 64 streams, an order of magnitude better than competing CPU or GPU designs.