Applications of rough sets in big data analysis: An overview

IF 1.6 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS International Journal of Applied Mathematics and Computer Science Pub Date : 2021-12-01 DOI:10.34768/amcs-2021-0046
P. Pięta, T. Szmuc
{"title":"Applications of rough sets in big data analysis: An overview","authors":"P. Pięta, T. Szmuc","doi":"10.34768/amcs-2021-0046","DOIUrl":null,"url":null,"abstract":"Abstract Big data, artificial intelligence and the Internet of things (IoT) are still very popular areas in current research and industrial applications. Processing massive amounts of data generated by the IoT and stored in distributed space is not a straightforward task and may cause many problems. During the last few decades, scientists have proposed many interesting approaches to extract information and discover knowledge from data collected in database systems or other sources. We observe a permanent development of machine learning algorithms that support each phase of the data mining process, ensuring achievement of better results than before. Rough set theory (RST) delivers a formal insight into information, knowledge, data reduction, uncertainty, and missing values. This formalism, formulated in the 1980s and developed by several researches, can serve as a theoretical basis and practical background for dealing with ambiguities, data reduction, building ontologies, etc. Moreover, as a mature theory, it has evolved into numerous extensions and has been transformed through various incarnations, which have enriched expressiveness and applicability of the related tools. The main aim of this article is to present an overview of selected applications of RST in big data analysis and processing. Thousands of publications on rough sets have been contributed; therefore, we focus on papers published in the last few years. The applications of RST are considered from two main perspectives: direct use of the RST concepts and tools, and jointly with other approaches, i.e., fuzzy sets, probabilistic concepts, and deep learning. The latter hybrid idea seems to be very promising for developing new methods and related tools as well as extensions of the application area.","PeriodicalId":50339,"journal":{"name":"International Journal of Applied Mathematics and Computer Science","volume":"30 1","pages":"659 - 683"},"PeriodicalIF":1.6000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Mathematics and Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.34768/amcs-2021-0046","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 6

Abstract

Abstract Big data, artificial intelligence and the Internet of things (IoT) are still very popular areas in current research and industrial applications. Processing massive amounts of data generated by the IoT and stored in distributed space is not a straightforward task and may cause many problems. During the last few decades, scientists have proposed many interesting approaches to extract information and discover knowledge from data collected in database systems or other sources. We observe a permanent development of machine learning algorithms that support each phase of the data mining process, ensuring achievement of better results than before. Rough set theory (RST) delivers a formal insight into information, knowledge, data reduction, uncertainty, and missing values. This formalism, formulated in the 1980s and developed by several researches, can serve as a theoretical basis and practical background for dealing with ambiguities, data reduction, building ontologies, etc. Moreover, as a mature theory, it has evolved into numerous extensions and has been transformed through various incarnations, which have enriched expressiveness and applicability of the related tools. The main aim of this article is to present an overview of selected applications of RST in big data analysis and processing. Thousands of publications on rough sets have been contributed; therefore, we focus on papers published in the last few years. The applications of RST are considered from two main perspectives: direct use of the RST concepts and tools, and jointly with other approaches, i.e., fuzzy sets, probabilistic concepts, and deep learning. The latter hybrid idea seems to be very promising for developing new methods and related tools as well as extensions of the application area.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
粗糙集在大数据分析中的应用综述
大数据、人工智能和物联网(IoT)仍然是当前研究和工业应用中非常受欢迎的领域。处理由物联网生成并存储在分布式空间中的大量数据并不是一项简单的任务,可能会导致许多问题。在过去的几十年里,科学家们提出了许多有趣的方法来从数据库系统或其他来源收集的数据中提取信息和发现知识。我们观察到机器学习算法的永久发展,支持数据挖掘过程的每个阶段,确保取得比以前更好的结果。粗糙集理论(RST)提供了对信息、知识、数据约简、不确定性和缺失值的正式见解。这种形式主义形成于20世纪80年代,经过几项研究的发展,可以作为处理歧义、数据约简、构建本体等的理论基础和实践背景。而且,作为一种成熟的理论,它已经演变出了许多扩展,并通过各种化身进行了转化,这丰富了相关工具的表现力和适用性。本文的主要目的是概述RST在大数据分析和处理中的应用。已提供了数千份关于粗糙集的出版物;因此,我们关注的是最近几年发表的论文。RST的应用主要从两个方面考虑:直接使用RST概念和工具,以及与其他方法(即模糊集、概率概念和深度学习)联合使用。后一种混合思想对于开发新方法和相关工具以及扩展应用领域似乎非常有希望。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.10
自引率
21.10%
发文量
0
审稿时长
4.2 months
期刊介绍: The International Journal of Applied Mathematics and Computer Science is a quarterly published in Poland since 1991 by the University of Zielona Góra in partnership with De Gruyter Poland (Sciendo) and Lubuskie Scientific Society, under the auspices of the Committee on Automatic Control and Robotics of the Polish Academy of Sciences. The journal strives to meet the demand for the presentation of interdisciplinary research in various fields related to control theory, applied mathematics, scientific computing and computer science. In particular, it publishes high quality original research results in the following areas: -modern control theory and practice- artificial intelligence methods and their applications- applied mathematics and mathematical optimisation techniques- mathematical methods in engineering, computer science, and biology.
期刊最新文献
Improving Security Performance of Healthcare Data in the Internet of Medical Things using a Hybrid Metaheuristic Model Robust Flat Filtering Control of a Two Degrees of Freedom Helicopter Subject to Tail Rotor Disturbances Choice of the p-norm for High Level Classification Features Pruning in Modern Convolutional Neural Networks With Local Sensitivity Analysis Travelling Waves for Low–Grade Glioma Growth and Response to A Chemotherapy Model Asts: Autonomous Switching of Task–Level Strategies
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1