The Integration of Hadoop and a Smart Utility Scoring with a Case-Based Reasoning System for Managing Large and Complex Medical Case Base

Seema Sharma, D. Mehrotra, Narjès Bellamine Ben Saoud
{"title":"The Integration of Hadoop and a Smart Utility Scoring with a Case-Based Reasoning System for Managing Large and Complex Medical Case Base","authors":"Seema Sharma, D. Mehrotra, Narjès Bellamine Ben Saoud","doi":"10.1142/s0219649222500812","DOIUrl":null,"url":null,"abstract":"In case-based reasoning structure, the quality and the complexity of the case data play a significant role in searching, retrieving, updating, and holding distinct case data at or from distinct databases for various purposes. However, managing the very complex and large case-data sets is not easy. Furthermore, the complex algorithm for maintaining case-based reasoning (CBR) structure makes it more critical and increases the time complexity considerably when the dataset is large and complex. Consequently, distinct available techniques related to the maintenance of the CBR system concentrate on deleting the less important and very complex case data to reduce the number of cases. At the same time, it reduces the efficiency and effectiveness of handling the CBR system. Hence, this research employs an integrated platform with the combination of the Hadoop parallel platform and an intelligent utility scoring mechanism. The smart indexing system and the parallel processing capacity of Hadoop reduce the time complexity for processing large and complex datasets. The distributed storage capability effectively manages the data repository and retrieval system for enormous case datasets without compromising case data. The intelligent utility scoring system employs supervised learning and a unique CBR system to quickly and efficiently retrieve the effective solution for each particular case. Furthermore, Hadoop offers a distributed structure that helps the user to access the case data and update the database through the network. This research uses distinct healthcare text-based datasets for testing the performances of the proposed integrated technique in different aspects. The experimental results show the superiorities in producing a higher percentage of accuracy with consuming less Retrieval time over other related techniques. It further shows its time efficiency by offering higher throughput and lower Cyclomatic Complexity with lower read and write Latency Times. Finally, the performances of the proposed technique in these various aspects have been compared with distinct existing techniques and the performances of various databases in managing large and complex medical data to establish its superiority over them.","PeriodicalId":127309,"journal":{"name":"J. Inf. Knowl. Manag.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Inf. Knowl. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1142/s0219649222500812","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In case-based reasoning structure, the quality and the complexity of the case data play a significant role in searching, retrieving, updating, and holding distinct case data at or from distinct databases for various purposes. However, managing the very complex and large case-data sets is not easy. Furthermore, the complex algorithm for maintaining case-based reasoning (CBR) structure makes it more critical and increases the time complexity considerably when the dataset is large and complex. Consequently, distinct available techniques related to the maintenance of the CBR system concentrate on deleting the less important and very complex case data to reduce the number of cases. At the same time, it reduces the efficiency and effectiveness of handling the CBR system. Hence, this research employs an integrated platform with the combination of the Hadoop parallel platform and an intelligent utility scoring mechanism. The smart indexing system and the parallel processing capacity of Hadoop reduce the time complexity for processing large and complex datasets. The distributed storage capability effectively manages the data repository and retrieval system for enormous case datasets without compromising case data. The intelligent utility scoring system employs supervised learning and a unique CBR system to quickly and efficiently retrieve the effective solution for each particular case. Furthermore, Hadoop offers a distributed structure that helps the user to access the case data and update the database through the network. This research uses distinct healthcare text-based datasets for testing the performances of the proposed integrated technique in different aspects. The experimental results show the superiorities in producing a higher percentage of accuracy with consuming less Retrieval time over other related techniques. It further shows its time efficiency by offering higher throughput and lower Cyclomatic Complexity with lower read and write Latency Times. Finally, the performances of the proposed technique in these various aspects have been compared with distinct existing techniques and the performances of various databases in managing large and complex medical data to establish its superiority over them.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
集成Hadoop和基于案例推理的智能效用评分系统,用于管理大型复杂医疗案例库
在基于案例的推理结构中,案例数据的质量和复杂性在搜索、检索、更新和保存不同数据库中的不同案例数据方面起着重要作用。然而,管理非常复杂和庞大的病例数据集并不容易。此外,当数据集庞大且复杂时,维护基于案例推理(case-based reasoning, CBR)结构的复杂算法使其更加关键,并大大增加了时间复杂度。因此,与CBR系统维护相关的不同可用技术集中于删除不太重要和非常复杂的病例数据,以减少病例数量。同时,降低了CBR系统的处理效率和效果。因此,本研究采用Hadoop并行平台与智能效用评分机制相结合的集成平台。Hadoop的智能索引系统和并行处理能力降低了处理大型复杂数据集的时间复杂度。分布式存储功能有效地管理了大量案例数据集的数据存储库和检索系统,而不会损害案例数据。智能效用评分系统采用监督学习和独特的CBR系统,快速有效地检索每个特定案例的有效解决方案。此外,Hadoop提供了一个分布式结构,帮助用户通过网络访问案例数据和更新数据库。本研究使用不同的基于医疗文本的数据集来测试所提出的集成技术在不同方面的性能。实验结果表明,与其他相关技术相比,该方法在提高检索准确率和减少检索时间方面具有优势。通过提供更高的吞吐量和更低的圈复杂度以及更低的读写延迟时间,它进一步显示了它的时间效率。最后,将所提出的技术在这些方面的性能与不同的现有技术和各种数据库在管理大型复杂医疗数据方面的性能进行了比较,以确定其优于它们。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Knowledge Management in Higher Education in Vietnam: Insights from Higher Education Leaders - An Exploratory Study The Organisation's Size-Innovation Performance Relationship: The Role of Human Resource Development Mechanisms A Comparative Review of Sentimental Analysis Using Machine Learning and Deep Learning Approaches Vocational Education Information Technology Based on Cross-Attention Fusion Knowledge Map Recommendation Algorithm Redesigning Knowledge Management Through Corporate Sustainability Strategy in the Post-Pandemic Era
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1