Modified Data Storage and Replication Mechanism with Frequent Use-Case Based Indexing

P. Selvaraj, V. Kannan, Bruno Voisin
{"title":"Modified Data Storage and Replication Mechanism with Frequent Use-Case Based Indexing","authors":"P. Selvaraj, V. Kannan, Bruno Voisin","doi":"10.1166/JCTN.2020.9413","DOIUrl":null,"url":null,"abstract":"The real time applications demands high speed and reliable data access from the remote database. An effective logical data management strategy that handles simultaneous connections with better performance negotiation is inevitable. This work considers an e-health care application that\n proposes MongoDB based modified indexing and performance tuning methods. To cope with certain high frequency use case and its performance mandates, a flexible and efficient logical data management may be preferred. By analysing the data dependency, data decomposition concerns and the performance\n requirements of the specific use case of the medical application, a logical schema may be customized on an ala-carte basis. This work focused on the flexible logical data modeling schemes and its performance factors of the NoSql DB. The efficiency of unstructured data base management in storing\n and retrieving the e-health care data was analysed with a web based tool. To enable faster data retrieval and query processing over the distributed nodes, a Spark based storage engine was built on top of the MongoDB based data storage management. With Spark tool, the database has been made\n distributed as master–slave structures with suitable data replication mechanisms. In such distributed database the fail-over also implemented with the suitable replication mechanism. This work considered MongoDB based flexible schema modeling and Spark based distributed computation with\n multiple chunks of data. The flexible data modeling scheme with MongoDB with the on-demand Spark based computation framework was proposed. To facilitate the eventual consistency, scalability aspects of the e-health care applications, use case based indexing was proposed. With the effective\n data management, faster query processing the horizontal scalability has been increased. The overall efficiency and scalability of the proposed logical data management approach was analysed. Through the simulation studies, the proposed approach has been claimed to boost the performance of the\n bigdata based application to a considerable extent.","PeriodicalId":15416,"journal":{"name":"Journal of Computational and Theoretical Nanoscience","volume":"17 1","pages":"5229-5237"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational and Theoretical Nanoscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1166/JCTN.2020.9413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Chemistry","Score":null,"Total":0}
引用次数: 0

Abstract

The real time applications demands high speed and reliable data access from the remote database. An effective logical data management strategy that handles simultaneous connections with better performance negotiation is inevitable. This work considers an e-health care application that proposes MongoDB based modified indexing and performance tuning methods. To cope with certain high frequency use case and its performance mandates, a flexible and efficient logical data management may be preferred. By analysing the data dependency, data decomposition concerns and the performance requirements of the specific use case of the medical application, a logical schema may be customized on an ala-carte basis. This work focused on the flexible logical data modeling schemes and its performance factors of the NoSql DB. The efficiency of unstructured data base management in storing and retrieving the e-health care data was analysed with a web based tool. To enable faster data retrieval and query processing over the distributed nodes, a Spark based storage engine was built on top of the MongoDB based data storage management. With Spark tool, the database has been made distributed as master–slave structures with suitable data replication mechanisms. In such distributed database the fail-over also implemented with the suitable replication mechanism. This work considered MongoDB based flexible schema modeling and Spark based distributed computation with multiple chunks of data. The flexible data modeling scheme with MongoDB with the on-demand Spark based computation framework was proposed. To facilitate the eventual consistency, scalability aspects of the e-health care applications, use case based indexing was proposed. With the effective data management, faster query processing the horizontal scalability has been increased. The overall efficiency and scalability of the proposed logical data management approach was analysed. Through the simulation studies, the proposed approach has been claimed to boost the performance of the bigdata based application to a considerable extent.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于频繁用例索引的改进数据存储和复制机制
实时应用程序要求从远程数据库进行高速可靠的数据访问。一种有效的逻辑数据管理策略是不可避免的,它可以通过更好的性能协商来处理同时的连接。这项工作考虑了一个电子医疗保健应用程序,该应用程序提出了基于MongoDB的修改索引和性能调优方法。为了应对某些高频用例及其性能要求,可能首选灵活高效的逻辑数据管理。通过分析医疗应用程序特定用例的数据依赖性、数据分解问题和性能要求,可以在点菜的基础上定制逻辑模式。本文重点研究了NoSql数据库灵活的逻辑数据建模方案及其性能因素。使用基于web的工具分析了非结构化数据库管理在存储和检索电子医疗保健数据方面的效率。为了在分布式节点上实现更快的数据检索和查询处理,在基于MongoDB的数据存储管理之上构建了一个基于Spark的存储引擎。使用Spark工具,数据库以主从结构的形式分布,并具有适当的数据复制机制。在这样的分布式数据库中,故障转移也通过适当的复制机制来实现。这项工作考虑了基于MongoDB的灵活模式建模和基于Spark的多数据块分布式计算。提出了一种基于需求Spark计算框架的灵活的MongoDB数据建模方案。为了促进电子医疗应用程序的最终一致性和可扩展性,提出了基于用例的索引。有了有效的数据管理,查询处理速度更快,横向可扩展性也得到了提高。分析了所提出的逻辑数据管理方法的总体效率和可扩展性。通过仿真研究,该方法在很大程度上提高了基于大数据的应用程序的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Computational and Theoretical Nanoscience
Journal of Computational and Theoretical Nanoscience 工程技术-材料科学:综合
自引率
0.00%
发文量
0
审稿时长
3.9 months
期刊介绍: Information not localized
期刊最新文献
The 'Insertion/Deletion' Polymorphism, rs4340 and Diabetes Risk: A Pilot Study from a Hospital Cohort. Reincluding: Providing Support to Reengage Youth who Truant in Secondary Schools. Eosinophil cationic protein (ECP) correlates with eosinophil cell counts in the induced sputum of elite swimmers. Synergic action of an inserted carbohydrate-binding module in a glycoside hydrolase family 5 endoglucanase. [Prognostic impact of prior cardiopathy in patients hospitalized with COVID-19 pneumonia].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1