Relational into Non-Relational Database Migration with Multiple-Nested Schema Methods on Academic Data

T. B. Adji, D. C. R. Sari, N. A. Setiawan
{"title":"Relational into Non-Relational Database Migration with Multiple-Nested Schema Methods on Academic Data","authors":"T. B. Adji, D. C. R. Sari, N. A. Setiawan","doi":"10.22146/IJITEE.46503","DOIUrl":null,"url":null,"abstract":"The rapid development of internet technology has increased the need of data storage and processing technology application. One application is to manage academic data records at educational institutions. Along with massive growth of information, decrement in the traditional database performance is inevitable. Hence, there are many companies choose to migrate to NoSQL, a technology that is able to overcome the traditional database shortcomings. However, the existing SQL to NoSQL migration tools have not been able to represent SQL data relations in NoSQL without limiting query performance. In this paper, a relational database transformation system transforming MySQL into non-relational database MongoDB was developed, using the Multiple Nested Schema method for academic databases. The development began with a transformation scheme design. The transformation scheme was then implemented in the migration process, using PDI/Kettle. The testing was carried out on three aspects, namely query response time, data integrity, and storage requirements. The test results showed that the developed system successfully represented the relationship of SQL data in NoSQL, provided complex query performance 13.32 times faster in the migration database, basic query performance involving SQL transaction tables 28.6 times faster on migration results, and basic performance Queries without involving SQL transaction tables were 3.91 times faster in the migration source. This shows that the theory of the Multiple Nested Schema method, aiming to overcome the poor performance of queries involving many JOIN operations, is proved. In addition, the system is also proven to be able to maintain data integrity in all tested queries. The space performance test results indicated that the migrated database transformed using the Multiple Nested Schema method showed a storage requirement of 10.53 times larger than the migration source database. This is due to the large amount of data redundancy resulting from the transformation process. However, at present, storage performance is not a top priority in data processing technology, so large storage requirements are a consequence of obtaining efficient query performance, which is still considered as the first priority in data processing technology.","PeriodicalId":292390,"journal":{"name":"IJITEE (International Journal of Information Technology and Electrical Engineering)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJITEE (International Journal of Information Technology and Electrical Engineering)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.22146/IJITEE.46503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The rapid development of internet technology has increased the need of data storage and processing technology application. One application is to manage academic data records at educational institutions. Along with massive growth of information, decrement in the traditional database performance is inevitable. Hence, there are many companies choose to migrate to NoSQL, a technology that is able to overcome the traditional database shortcomings. However, the existing SQL to NoSQL migration tools have not been able to represent SQL data relations in NoSQL without limiting query performance. In this paper, a relational database transformation system transforming MySQL into non-relational database MongoDB was developed, using the Multiple Nested Schema method for academic databases. The development began with a transformation scheme design. The transformation scheme was then implemented in the migration process, using PDI/Kettle. The testing was carried out on three aspects, namely query response time, data integrity, and storage requirements. The test results showed that the developed system successfully represented the relationship of SQL data in NoSQL, provided complex query performance 13.32 times faster in the migration database, basic query performance involving SQL transaction tables 28.6 times faster on migration results, and basic performance Queries without involving SQL transaction tables were 3.91 times faster in the migration source. This shows that the theory of the Multiple Nested Schema method, aiming to overcome the poor performance of queries involving many JOIN operations, is proved. In addition, the system is also proven to be able to maintain data integrity in all tested queries. The space performance test results indicated that the migrated database transformed using the Multiple Nested Schema method showed a storage requirement of 10.53 times larger than the migration source database. This is due to the large amount of data redundancy resulting from the transformation process. However, at present, storage performance is not a top priority in data processing technology, so large storage requirements are a consequence of obtaining efficient query performance, which is still considered as the first priority in data processing technology.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于多嵌套模式方法的学术数据关系型数据库向非关系型数据库迁移
互联网技术的飞速发展,对数据存储和处理技术的应用提出了更高的要求。其中一个应用是管理教育机构的学术数据记录。随着信息的大量增长,传统数据库的性能下降是不可避免的。因此,有许多公司选择迁移到NoSQL,这种技术能够克服传统数据库的缺点。然而,现有的SQL到NoSQL的迁移工具还不能在不限制查询性能的情况下用NoSQL表示SQL数据关系。本文采用学术数据库的多重嵌套模式方法,开发了一个将MySQL转换为非关系数据库MongoDB的关系数据库转换系统。开发从改造方案设计开始。然后在迁移过程中使用PDI/Kettle实现转换方案。测试从三个方面进行,即查询响应时间、数据完整性和存储需求。测试结果表明,开发的系统成功地在NoSQL中表示SQL数据之间的关系,在迁移数据库中提供的复杂查询性能提高了13.32倍,在迁移结果中涉及SQL事务表的基本查询性能提高了28.6倍,在迁移源中不涉及SQL事务表的基本查询性能提高了3.91倍。这表明旨在克服涉及许多JOIN操作的查询性能差的Multiple Nested Schema方法的理论得到了证明。此外,该系统还被证明能够在所有测试查询中保持数据完整性。空间性能测试结果表明,使用多重嵌套模式方法转换的迁移数据库的存储需求比迁移源数据库大10.53倍。这是由于转换过程产生了大量的数据冗余。然而,在目前的数据处理技术中,存储性能并不是最优先考虑的,因此,获得高效的查询性能是对存储的巨大需求的结果,而查询性能仍然是数据处理技术的第一优先考虑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Eye Blink Classification for Assisting Disability to Communicate Using Bagging and Boosting Product Recommendation Based on Eye Tracking Data Using Fixation Duration Optimal Capacity and Location Wind Turbine to Minimize Power Losses Using NSGA-II Factors Affecting Collaboration Portal Effectiveness of the Audit Board of Indonesia Piezoelectric Energy Harvester for IoT Sensor Devices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1