Modified Data Storage and Replication Mechanism with Frequent Use-Case Based Indexing

Q3 Chemistry Journal of Computational and Theoretical Nanoscience Pub Date : 2020-12-01 DOI:10.1166/JCTN.2020.9413

P. Selvaraj, V. Kannan, Bruno Voisin

{"title":"Modified Data Storage and Replication Mechanism with Frequent Use-Case Based Indexing","authors":"P. Selvaraj, V. Kannan, Bruno Voisin","doi":"10.1166/JCTN.2020.9413","DOIUrl":null,"url":null,"abstract":"The real time applications demands high speed and reliable data access from the remote database. An effective logical data management strategy that handles simultaneous connections with better performance negotiation is inevitable. This work considers an e-health care application that\n proposes MongoDB based modified indexing and performance tuning methods. To cope with certain high frequency use case and its performance mandates, a flexible and efficient logical data management may be preferred. By analysing the data dependency, data decomposition concerns and the performance\n requirements of the specific use case of the medical application, a logical schema may be customized on an ala-carte basis. This work focused on the flexible logical data modeling schemes and its performance factors of the NoSql DB. The efficiency of unstructured data base management in storing\n and retrieving the e-health care data was analysed with a web based tool. To enable faster data retrieval and query processing over the distributed nodes, a Spark based storage engine was built on top of the MongoDB based data storage management. With Spark tool, the database has been made\n distributed as master–slave structures with suitable data replication mechanisms. In such distributed database the fail-over also implemented with the suitable replication mechanism. This work considered MongoDB based flexible schema modeling and Spark based distributed computation with\n multiple chunks of data. The flexible data modeling scheme with MongoDB with the on-demand Spark based computation framework was proposed. To facilitate the eventual consistency, scalability aspects of the e-health care applications, use case based indexing was proposed. With the effective\n data management, faster query processing the horizontal scalability has been increased. The overall efficiency and scalability of the proposed logical data management approach was analysed. Through the simulation studies, the proposed approach has been claimed to boost the performance of the\n bigdata based application to a considerable extent.","PeriodicalId":15416,"journal":{"name":"Journal of Computational and Theoretical Nanoscience","volume":"17 1","pages":"5229-5237"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational and Theoretical Nanoscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1166/JCTN.2020.9413","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Chemistry","Score":null,"Total":0}

引用次数: 0

Abstract

The real time applications demands high speed and reliable data access from the remote database. An effective logical data management strategy that handles simultaneous connections with better performance negotiation is inevitable. This work considers an e-health care application that proposes MongoDB based modified indexing and performance tuning methods. To cope with certain high frequency use case and its performance mandates, a flexible and efficient logical data management may be preferred. By analysing the data dependency, data decomposition concerns and the performance requirements of the specific use case of the medical application, a logical schema may be customized on an ala-carte basis. This work focused on the flexible logical data modeling schemes and its performance factors of the NoSql DB. The efficiency of unstructured data base management in storing and retrieving the e-health care data was analysed with a web based tool. To enable faster data retrieval and query processing over the distributed nodes, a Spark based storage engine was built on top of the MongoDB based data storage management. With Spark tool, the database has been made distributed as master–slave structures with suitable data replication mechanisms. In such distributed database the fail-over also implemented with the suitable replication mechanism. This work considered MongoDB based flexible schema modeling and Spark based distributed computation with multiple chunks of data. The flexible data modeling scheme with MongoDB with the on-demand Spark based computation framework was proposed. To facilitate the eventual consistency, scalability aspects of the e-health care applications, use case based indexing was proposed. With the effective data management, faster query processing the horizontal scalability has been increased. The overall efficiency and scalability of the proposed logical data management approach was analysed. Through the simulation studies, the proposed approach has been claimed to boost the performance of the bigdata based application to a considerable extent.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于频繁用例索引的改进数据存储和复制机制

实时应用程序要求从远程数据库进行高速可靠的数据访问。一种有效的逻辑数据管理策略是不可避免的，它可以通过更好的性能协商来处理同时的连接。这项工作考虑了一个电子医疗保健应用程序，该应用程序提出了基于MongoDB的修改索引和性能调优方法。为了应对某些高频用例及其性能要求，可能首选灵活高效的逻辑数据管理。通过分析医疗应用程序特定用例的数据依赖性、数据分解问题和性能要求，可以在点菜的基础上定制逻辑模式。本文重点研究了NoSql数据库灵活的逻辑数据建模方案及其性能因素。使用基于web的工具分析了非结构化数据库管理在存储和检索电子医疗保健数据方面的效率。为了在分布式节点上实现更快的数据检索和查询处理，在基于MongoDB的数据存储管理之上构建了一个基于Spark的存储引擎。使用Spark工具，数据库以主从结构的形式分布，并具有适当的数据复制机制。在这样的分布式数据库中，故障转移也通过适当的复制机制来实现。这项工作考虑了基于MongoDB的灵活模式建模和基于Spark的多数据块分布式计算。提出了一种基于需求Spark计算框架的灵活的MongoDB数据建模方案。为了促进电子医疗应用程序的最终一致性和可扩展性，提出了基于用例的索引。有了有效的数据管理，查询处理速度更快，横向可扩展性也得到了提高。分析了所提出的逻辑数据管理方法的总体效率和可扩展性。通过仿真研究，该方法在很大程度上提高了基于大数据的应用程序的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊