Applications of rough sets in big data analysis: An overview

IF 1.2 4区计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS International Journal of Applied Mathematics and Computer Science Pub Date : 2021-12-01 DOI:10.34768/amcs-2021-0046

P. Pięta, T. Szmuc

{"title":"Applications of rough sets in big data analysis: An overview","authors":"P. Pięta, T. Szmuc","doi":"10.34768/amcs-2021-0046","DOIUrl":null,"url":null,"abstract":"Abstract Big data, artificial intelligence and the Internet of things (IoT) are still very popular areas in current research and industrial applications. Processing massive amounts of data generated by the IoT and stored in distributed space is not a straightforward task and may cause many problems. During the last few decades, scientists have proposed many interesting approaches to extract information and discover knowledge from data collected in database systems or other sources. We observe a permanent development of machine learning algorithms that support each phase of the data mining process, ensuring achievement of better results than before. Rough set theory (RST) delivers a formal insight into information, knowledge, data reduction, uncertainty, and missing values. This formalism, formulated in the 1980s and developed by several researches, can serve as a theoretical basis and practical background for dealing with ambiguities, data reduction, building ontologies, etc. Moreover, as a mature theory, it has evolved into numerous extensions and has been transformed through various incarnations, which have enriched expressiveness and applicability of the related tools. The main aim of this article is to present an overview of selected applications of RST in big data analysis and processing. Thousands of publications on rough sets have been contributed; therefore, we focus on papers published in the last few years. The applications of RST are considered from two main perspectives: direct use of the RST concepts and tools, and jointly with other approaches, i.e., fuzzy sets, probabilistic concepts, and deep learning. The latter hybrid idea seems to be very promising for developing new methods and related tools as well as extensions of the application area.","PeriodicalId":50339,"journal":{"name":"International Journal of Applied Mathematics and Computer Science","volume":"30 1","pages":"659 - 683"},"PeriodicalIF":1.2000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Applied Mathematics and Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.34768/amcs-2021-0046","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 6

Abstract

Abstract Big data, artificial intelligence and the Internet of things (IoT) are still very popular areas in current research and industrial applications. Processing massive amounts of data generated by the IoT and stored in distributed space is not a straightforward task and may cause many problems. During the last few decades, scientists have proposed many interesting approaches to extract information and discover knowledge from data collected in database systems or other sources. We observe a permanent development of machine learning algorithms that support each phase of the data mining process, ensuring achievement of better results than before. Rough set theory (RST) delivers a formal insight into information, knowledge, data reduction, uncertainty, and missing values. This formalism, formulated in the 1980s and developed by several researches, can serve as a theoretical basis and practical background for dealing with ambiguities, data reduction, building ontologies, etc. Moreover, as a mature theory, it has evolved into numerous extensions and has been transformed through various incarnations, which have enriched expressiveness and applicability of the related tools. The main aim of this article is to present an overview of selected applications of RST in big data analysis and processing. Thousands of publications on rough sets have been contributed; therefore, we focus on papers published in the last few years. The applications of RST are considered from two main perspectives: direct use of the RST concepts and tools, and jointly with other approaches, i.e., fuzzy sets, probabilistic concepts, and deep learning. The latter hybrid idea seems to be very promising for developing new methods and related tools as well as extensions of the application area.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

粗糙集在大数据分析中的应用综述

大数据、人工智能和物联网(IoT)仍然是当前研究和工业应用中非常受欢迎的领域。处理由物联网生成并存储在分布式空间中的大量数据并不是一项简单的任务，可能会导致许多问题。在过去的几十年里，科学家们提出了许多有趣的方法来从数据库系统或其他来源收集的数据中提取信息和发现知识。我们观察到机器学习算法的永久发展，支持数据挖掘过程的每个阶段，确保取得比以前更好的结果。粗糙集理论(RST)提供了对信息、知识、数据约简、不确定性和缺失值的正式见解。这种形式主义形成于20世纪80年代，经过几项研究的发展，可以作为处理歧义、数据约简、构建本体等的理论基础和实践背景。而且，作为一种成熟的理论，它已经演变出了许多扩展，并通过各种化身进行了转化，这丰富了相关工具的表现力和适用性。本文的主要目的是概述RST在大数据分析和处理中的应用。已提供了数千份关于粗糙集的出版物;因此，我们关注的是最近几年发表的论文。RST的应用主要从两个方面考虑:直接使用RST概念和工具，以及与其他方法(即模糊集、概率概念和深度学习)联合使用。后一种混合思想对于开发新方法和相关工具以及扩展应用领域似乎非常有希望。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Journal of Applied Mathematics and Computer Science 工程技术-计算机：人工智能

CiteScore

4.10

自引率

21.10%

发文量

审稿时长

4.2 months

期刊介绍： The International Journal of Applied Mathematics and Computer Science is a quarterly published in Poland since 1991 by the University of Zielona Góra in partnership with De Gruyter Poland (Sciendo) and Lubuskie Scientific Society, under the auspices of the Committee on Automatic Control and Robotics of the Polish Academy of Sciences. The journal strives to meet the demand for the presentation of interdisciplinary research in various fields related to control theory, applied mathematics, scientific computing and computer science. In particular, it publishes high quality original research results in the following areas: -modern control theory and practice- artificial intelligence methods and their applications- applied mathematics and mathematical optimisation techniques- mathematical methods in engineering, computer science, and biology.