A Concept of an In-Memory Database for IoT Sensor Data

Marina Burdack, M. Rössle, René Kübler
{"title":"A Concept of an In-Memory Database for IoT Sensor Data","authors":"Marina Burdack, M. Rössle, René Kübler","doi":"10.30958/AJS.5-4-4","DOIUrl":null,"url":null,"abstract":"In the context of digital transformation and use of Industry 4.0 technology in companies, machines and other objects are increasingly being equipped with sensors. Normally, these machines are monitored 24/7, so that data streams are continuously generated by sensors. These data has to be stored in a database. In order to facilitate a fast data mining process and the use of machine learning algorithms, a performant and robust data store for the vast amount of sensor data is necessary. These raw time series sensor data has typical structures that are difficult to model with traditional database management systems. Here, column-oriented In-Memory databases like SAP HANA or Gorilla are better suited. However, SAP HANA have not been developed to store relational data, so that it contains components like transaction and concurrency control, which are unnecessary for the named range of application, because machine learning algorithms only need reading access. By reducing this concept to the essentials, a specialized, lightweight In-Memory database management system can be developed, which perfectly fits to the characteristics of time series sensor data. For that concept the benefits of the In-Memory data structure of SAP HANA and Facebook Gorilla are merged and combined with additional meta information like limits for minimum and maximum warning for each sensor, special user specified column fields or rules for sampling and replenishment values. The evaluation of the implemented prototype shows on the one hand that the time series sensor data can be stored efficiently using a new table structure and an intelligent combination of the ZFP compression method with a block orientated data structure, which results in a good insert performance. On the other hand, this storage logic leads to an efficient data access of the compressed in-memory data structure, thus every reporting or analyzing tasks access the data efficiently and fast.","PeriodicalId":91843,"journal":{"name":"Athens journal of sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Athens journal of sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.30958/AJS.5-4-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

In the context of digital transformation and use of Industry 4.0 technology in companies, machines and other objects are increasingly being equipped with sensors. Normally, these machines are monitored 24/7, so that data streams are continuously generated by sensors. These data has to be stored in a database. In order to facilitate a fast data mining process and the use of machine learning algorithms, a performant and robust data store for the vast amount of sensor data is necessary. These raw time series sensor data has typical structures that are difficult to model with traditional database management systems. Here, column-oriented In-Memory databases like SAP HANA or Gorilla are better suited. However, SAP HANA have not been developed to store relational data, so that it contains components like transaction and concurrency control, which are unnecessary for the named range of application, because machine learning algorithms only need reading access. By reducing this concept to the essentials, a specialized, lightweight In-Memory database management system can be developed, which perfectly fits to the characteristics of time series sensor data. For that concept the benefits of the In-Memory data structure of SAP HANA and Facebook Gorilla are merged and combined with additional meta information like limits for minimum and maximum warning for each sensor, special user specified column fields or rules for sampling and replenishment values. The evaluation of the implemented prototype shows on the one hand that the time series sensor data can be stored efficiently using a new table structure and an intelligent combination of the ZFP compression method with a block orientated data structure, which results in a good insert performance. On the other hand, this storage logic leads to an efficient data access of the compressed in-memory data structure, thus every reporting or analyzing tasks access the data efficiently and fast.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
物联网传感器数据的内存数据库概念
在数字化转型和企业使用工业4.0技术的背景下,机器和其他物体越来越多地配备了传感器。通常,这些机器是全天候监控的,因此传感器会连续生成数据流。这些数据必须存储在数据库中。为了促进快速的数据挖掘过程和机器学习算法的使用,有必要为大量的传感器数据提供一个高性能和稳健的数据存储。这些原始时间序列传感器数据具有传统数据库管理系统难以建模的典型结构。在这里,像SAP HANA或Gorilla这样的面向列的In-Memory数据库更适合。然而,SAP HANA尚未被开发用于存储关系数据,因此它包含事务和并发控制等组件,这些组件对于命名的应用程序范围来说是不必要的,因为机器学习算法只需要读取访问权限。通过将这一概念简化为基本概念,可以开发出一个专门的、轻量级的内存数据库管理系统,该系统完全符合时间序列传感器数据的特点。对于这一概念,SAP HANA和Facebook Gorilla的内存中数据结构的优势与额外的元信息相结合,如每个传感器的最小和最大警告限制、用户指定的特殊列字段或采样和补充值规则。对实现的原型的评估表明,一方面,使用新的表结构和ZFP压缩方法与面向块的数据结构的智能组合,可以有效地存储时间序列传感器数据,这导致了良好的插入性能。另一方面,这种存储逻辑导致了对压缩的内存中数据结构的有效数据访问,因此每个报告或分析任务都能有效而快速地访问数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Growth of Crayfish, which Serves as an Indicator of Clean and Healthy Water Ecosystems in the Mediterranean Region Identifying Sustainability Efforts in Company’s Reports Using Text Mining and Machine Learning Generative Urban Design in the Field of Infrastructure: An Optimizing Solution for Connecting Fier and Vlora County by a 600 m Bridge over Selenica River, Albania Infrastructures of Large-Scale Geothermal Energy Projects in Kenya: Materialization, Generativity, and Socio-Economic Development Linkages An Analysis of Stream Flow and Flood Frequency: A Case Study from Downstream of Kelani River Basin, Sri Lanka
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1