Stargazer: A Deep Learning Approach for Estimating the Performance of Edge- Based Clustering Applications

2020 IEEE International Conference on Smart Data Services (SMDS) Pub Date : 2020-10-01 DOI:10.1109/SMDS49396.2020.00009

Breno Dantas Cruz, A. Paul, Z. Song, E. Tilevich

{"title":"Stargazer: A Deep Learning Approach for Estimating the Performance of Edge- Based Clustering Applications","authors":"Breno Dantas Cruz, A. Paul, Z. Song, E. Tilevich","doi":"10.1109/SMDS49396.2020.00009","DOIUrl":null,"url":null,"abstract":"As a solution to the sensor data deluge, edge computing processes sensor data by means of local devices. Many of these devices are resource-scarce in terms of the available processing capabilities and battery power. To achieve the required design trade-offs of edge applications, developers must be able to understand the performance and resource utilization of data processing algorithms. An increasing number of edge-based applications use machine learning (ML) as their key functionality. However, the performance and resource utilization of ML algorithms remain poorly understood, thus hindering the system design of edge-based ML applications. In addition, developers often cannot access real-world edge-based test beds during the design phase. To address this problem, we present an approach for estimating the performance of edge-based ML applications, with a particular application to clustering. To that end, we first comprehensively evaluate the performance and resource utilization of widely used clustering algorithms deployed in a representative edge environment. Second, we identify which properties of these algorithms are correlated with their performance and resource utilization. Finally, we apply our findings to create Stargazer, a Deep Neural Network that given a clustering algorithm's computational load and input data size, estimates how this algorithm would perform and utilize resources in an edge-based application. Our tool provides viable decision-making support for addressing the multifaceted design challenges of edge-based ML applications.","PeriodicalId":385149,"journal":{"name":"2020 IEEE International Conference on Smart Data Services (SMDS)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Smart Data Services (SMDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMDS49396.2020.00009","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

As a solution to the sensor data deluge, edge computing processes sensor data by means of local devices. Many of these devices are resource-scarce in terms of the available processing capabilities and battery power. To achieve the required design trade-offs of edge applications, developers must be able to understand the performance and resource utilization of data processing algorithms. An increasing number of edge-based applications use machine learning (ML) as their key functionality. However, the performance and resource utilization of ML algorithms remain poorly understood, thus hindering the system design of edge-based ML applications. In addition, developers often cannot access real-world edge-based test beds during the design phase. To address this problem, we present an approach for estimating the performance of edge-based ML applications, with a particular application to clustering. To that end, we first comprehensively evaluate the performance and resource utilization of widely used clustering algorithms deployed in a representative edge environment. Second, we identify which properties of these algorithms are correlated with their performance and resource utilization. Finally, we apply our findings to create Stargazer, a Deep Neural Network that given a clustering algorithm's computational load and input data size, estimates how this algorithm would perform and utilize resources in an edge-based application. Our tool provides viable decision-making support for addressing the multifaceted design challenges of edge-based ML applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于边缘聚类应用的性能评估的深度学习方法

作为一种解决传感器数据泛滥的方法，边缘计算通过本地设备处理传感器数据。在可用的处理能力和电池电量方面，这些设备中的许多都是资源稀缺的。为了实现边缘应用程序所需的设计权衡，开发人员必须能够理解数据处理算法的性能和资源利用率。越来越多的基于边缘的应用程序使用机器学习(ML)作为其关键功能。然而，机器学习算法的性能和资源利用仍然知之甚少，从而阻碍了基于边缘的机器学习应用的系统设计。此外，在设计阶段，开发人员通常无法访问真实的基于边缘的测试平台。为了解决这个问题，我们提出了一种方法来估计基于边缘的机器学习应用程序的性能，并对集群进行了特定的应用。为此，我们首先全面评估了在代表性边缘环境中部署的广泛使用的聚类算法的性能和资源利用率。其次，我们确定这些算法的哪些属性与其性能和资源利用率相关。最后，我们将我们的发现应用于创建Stargazer，这是一个深度神经网络，给定聚类算法的计算负载和输入数据大小，估计该算法在基于边缘的应用程序中如何执行和利用资源。我们的工具为解决基于边缘的机器学习应用程序的多方面设计挑战提供了可行的决策支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 IEEE International Conference on Smart Data Services (SMDS)

自引率

0.00%

发文量