{"title":"Proactive Data Allocation in Distributed Datasets based on an Ensemble Model","authors":"T. Koukaras, Kostas Kolomvatsos","doi":"10.1109/ICICS52457.2021.9464621","DOIUrl":null,"url":null,"abstract":"The evolution of the Internet of Things (IoT) drives the incorporation of numerous devices into a huge infrastructure where various services can be provided. Devices are located close to users being capable to interact with them and their environment to collect data. The collected data are transferred to the Cloud through the ‘intervention’ of the Edge Computing (EC) infrastructure. Multiple nodes are present at the EC that can undertake the responsibility of keeping some processing activities close to end users, thus, minimizing the latency in the provision of responses. In this paper, we elaborate on a model that supports an efficient data management mechanism to proactively decide the location where data should be stored. We aim at concluding a number of datasets exhibiting a high accuracy as exposed by their solidity. The proposed approach deals with the similarity of the collected data and the already formulated datasets before we decide the final allocation. Any decision is made upon the synopses of the discussed datasets avoiding the processing of huge volumes of data. Additionally, we elaborate on an ensemble scheme for matching the incoming observations and the available synopses. The performance of the proposed scheme is depicted by the relevant numerical outcomes.","PeriodicalId":421803,"journal":{"name":"2021 12th International Conference on Information and Communication Systems (ICICS)","volume":"97 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information and Communication Systems (ICICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICS52457.2021.9464621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The evolution of the Internet of Things (IoT) drives the incorporation of numerous devices into a huge infrastructure where various services can be provided. Devices are located close to users being capable to interact with them and their environment to collect data. The collected data are transferred to the Cloud through the ‘intervention’ of the Edge Computing (EC) infrastructure. Multiple nodes are present at the EC that can undertake the responsibility of keeping some processing activities close to end users, thus, minimizing the latency in the provision of responses. In this paper, we elaborate on a model that supports an efficient data management mechanism to proactively decide the location where data should be stored. We aim at concluding a number of datasets exhibiting a high accuracy as exposed by their solidity. The proposed approach deals with the similarity of the collected data and the already formulated datasets before we decide the final allocation. Any decision is made upon the synopses of the discussed datasets avoiding the processing of huge volumes of data. Additionally, we elaborate on an ensemble scheme for matching the incoming observations and the available synopses. The performance of the proposed scheme is depicted by the relevant numerical outcomes.