Panagiotis Drakatos, Constantinos Costa, Andreas Konstantinidis, Panos K. Chrysanthis, Demetrios Zeinalipour-Yazti
{"title":"A blockchain datastore for scalable IoT workloads using data decaying","authors":"Panagiotis Drakatos, Constantinos Costa, Andreas Konstantinidis, Panos K. Chrysanthis, Demetrios Zeinalipour-Yazti","doi":"10.1007/s10619-024-07441-9","DOIUrl":null,"url":null,"abstract":"<p>The Internet of Things (IoT) revolution has introduced sensor-rich devices to an ever growing landscape of smart environments. A key component in the IoT scenarios of the future is the requirement to utilize a shared database that allows all participants to operate collaboratively, transparently, immutably, correctly and with performance guarantees. Blockchain databases have been proposed by the community to alleviate these challenges, however existing blockchain architectures suffer from performance issues. In this paper we introduce Triabase, a novel permissioned blockchain system architecture that applies data decaying concepts to cope with scalability issues in regards to blockchain consensus and storage efficiency. For blockchain consensus, we propose the Proof of Federated Learning (PoFL) algorithm which exploits data decaying models as Proof-of-Work. For storage efficiency, we exploit federated learning to construct data postdiction machine learning models to minimize the storage of bulky data on the blockchain. We present a detailed explanation of our system architecture as well as the implementation in the Hyperledger fabric framework. We use our implementation to carry out an experimental evaluation with telco big data at scale showing that our framework exposes desirable qualities, namely efficient consensus at the blockchain layer while optimizing storage efficiency.</p>","PeriodicalId":50568,"journal":{"name":"Distributed and Parallel Databases","volume":"156 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Distributed and Parallel Databases","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10619-024-07441-9","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The Internet of Things (IoT) revolution has introduced sensor-rich devices to an ever growing landscape of smart environments. A key component in the IoT scenarios of the future is the requirement to utilize a shared database that allows all participants to operate collaboratively, transparently, immutably, correctly and with performance guarantees. Blockchain databases have been proposed by the community to alleviate these challenges, however existing blockchain architectures suffer from performance issues. In this paper we introduce Triabase, a novel permissioned blockchain system architecture that applies data decaying concepts to cope with scalability issues in regards to blockchain consensus and storage efficiency. For blockchain consensus, we propose the Proof of Federated Learning (PoFL) algorithm which exploits data decaying models as Proof-of-Work. For storage efficiency, we exploit federated learning to construct data postdiction machine learning models to minimize the storage of bulky data on the blockchain. We present a detailed explanation of our system architecture as well as the implementation in the Hyperledger fabric framework. We use our implementation to carry out an experimental evaluation with telco big data at scale showing that our framework exposes desirable qualities, namely efficient consensus at the blockchain layer while optimizing storage efficiency.
期刊介绍:
Distributed and Parallel Databases publishes papers in all the traditional as well as most emerging areas of database research, including:
Availability and reliability;
Benchmarking and performance evaluation, and tuning;
Big Data Storage and Processing;
Cloud Computing and Database-as-a-Service;
Crowdsourcing;
Data curation, annotation and provenance;
Data integration, metadata Management, and interoperability;
Data models, semantics, query languages;
Data mining and knowledge discovery;
Data privacy, security, trust;
Data provenance, workflows, Scientific Data Management;
Data visualization and interactive data exploration;
Data warehousing, OLAP, Analytics;
Graph data management, RDF, social networks;
Information Extraction and Data Cleaning;
Middleware and Workflow Management;
Modern Hardware and In-Memory Database Systems;
Query Processing and Optimization;
Semantic Web and open data;
Social Networks;
Storage, indexing, and physical database design;
Streams, sensor networks, and complex event processing;
Strings, Texts, and Keyword Search;
Spatial, temporal, and spatio-temporal databases;
Transaction processing;
Uncertain, probabilistic, and approximate databases.