In order to engineer new materials, structures, systems, and processes that address persistent challenges, engineers seek to tie causes to effects and understand the effects of causes. Such a pursuit requires a causal investigation to uncover the underlying structure of the data generating process (DGP) governing phenomena. A causal approach derives causal models that engineers can adopt to infer the effects of interventions (and explore possible counterfactuals). Yet, and for the most part, we continue to design experiments in the hope of empirically observing engineered intervention(s). Such experiments are idealized, complex, and costly and hence are narrow in scope. On the contrary, a causal investigation will allow us to peek into the how and why of a DGP and provide us with the essential means to articulate a causal model that accurately describes the phenomenon on hand and better predicts the outcome of possible interventions. Adopting a causal approach in engineering is perhaps more warranted than ever—especially with the rise of big data and the adoption of artificial intelligence (AI); wherein AI models are naivety presumed to describe causal ties. To bridge such knowledge gap, this primer presents fundamental principles behind causal discovery, causal inference, and counterfactuals from an engineering perspective and contrasts that to those pertaining to correlation, regression, and AI.This article is categorized under: Application Areas > Industry Specific Applications Algorithmic Development > Causality Discovery Application Areas > Science and Technology Technologies > Machine Learning
{"title":"Causality and causal inference for engineers: Beyond correlation, regression, prediction and artificial intelligence","authors":"M. Naser","doi":"10.1002/widm.1533","DOIUrl":"https://doi.org/10.1002/widm.1533","url":null,"abstract":"In order to engineer new materials, structures, systems, and processes that address persistent challenges, engineers seek to tie causes to effects and understand the effects of causes. Such a pursuit requires a causal investigation to uncover the underlying structure of the data generating process (DGP) governing phenomena. A causal approach derives causal models that engineers can adopt to infer the effects of interventions (and explore possible counterfactuals). Yet, and for the most part, we continue to design experiments in the hope of empirically observing engineered intervention(s). Such experiments are idealized, complex, and costly and hence are narrow in scope. On the contrary, a causal investigation will allow us to peek into the how and why of a DGP and provide us with the essential means to articulate a causal model that accurately describes the phenomenon on hand and better predicts the outcome of possible interventions. Adopting a causal approach in engineering is perhaps more warranted than ever—especially with the rise of big data and the adoption of artificial intelligence (AI); wherein AI models are naivety presumed to describe causal ties. To bridge such knowledge gap, this primer presents fundamental principles behind causal discovery, causal inference, and counterfactuals from an engineering perspective and contrasts that to those pertaining to correlation, regression, and AI.This article is categorized under:\u0000Application Areas > Industry Specific Applications\u0000Algorithmic Development > Causality Discovery\u0000Application Areas > Science and Technology\u0000Technologies > Machine Learning\u0000","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"255 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140255719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mohamed Abd Elaziz, Mohammed A. A. Al‐qaness, Abdelghani Dahou, Saeed Hamood Alsamhi, Laith Abualigah, Rehab Ali Ibrahim, Ahmed A. Ewees
Abstract The sixth generation (6G) represents the next evolution in wireless communication technology and is currently under research and development. It is expected to deliver faster speeds, reduced latency, and greater capacity compared to the current 5G wireless technology. 6G is envisioned as a technology capable of establishing a fully data‐driven network, proficient in analyzing and optimizing end‐to‐end behavior and handling massive volumes of real‐time data at rates of up to terabits per second (Tb/s). Moreover, 6G is designed to accommodate an average of 1000+ substantial connections per person over the course of a decade. The concept of a data‐driven network introduces a new service paradigm, which offers fresh opportunities for applications within 6G wireless communication and network design in the future. This paper aims to provide a survey of existing applications of 6G that are based on deep learning techniques. It also explores the potential, essential technologies, scenarios, challenges, and related topics associated with 6G. These aspects are crucial for meeting the requirements for the development of future intelligent networks. Furthermore, this work delves into various research gaps between deep learning and 6G that remain unexplored. Different potential deep learning applications for 6G networks, including privacy, security, environmentally friendly communication, sustainability, and various wireless applications, are discussed. Additionally, we shed light on the challenges and future trends in this field. This article is categorized under: Technologies > Computational Intelligence Fundamental Concepts of Data and Knowledge > Explainable AI Technologies > Machine Learning
{"title":"Evolution toward intelligent communications: Impact of deep learning applications on the future of <scp>6G</scp> technology","authors":"Mohamed Abd Elaziz, Mohammed A. A. Al‐qaness, Abdelghani Dahou, Saeed Hamood Alsamhi, Laith Abualigah, Rehab Ali Ibrahim, Ahmed A. Ewees","doi":"10.1002/widm.1521","DOIUrl":"https://doi.org/10.1002/widm.1521","url":null,"abstract":"Abstract The sixth generation (6G) represents the next evolution in wireless communication technology and is currently under research and development. It is expected to deliver faster speeds, reduced latency, and greater capacity compared to the current 5G wireless technology. 6G is envisioned as a technology capable of establishing a fully data‐driven network, proficient in analyzing and optimizing end‐to‐end behavior and handling massive volumes of real‐time data at rates of up to terabits per second (Tb/s). Moreover, 6G is designed to accommodate an average of 1000+ substantial connections per person over the course of a decade. The concept of a data‐driven network introduces a new service paradigm, which offers fresh opportunities for applications within 6G wireless communication and network design in the future. This paper aims to provide a survey of existing applications of 6G that are based on deep learning techniques. It also explores the potential, essential technologies, scenarios, challenges, and related topics associated with 6G. These aspects are crucial for meeting the requirements for the development of future intelligent networks. Furthermore, this work delves into various research gaps between deep learning and 6G that remain unexplored. Different potential deep learning applications for 6G networks, including privacy, security, environmentally friendly communication, sustainability, and various wireless applications, are discussed. Additionally, we shed light on the challenges and future trends in this field. This article is categorized under: Technologies > Computational Intelligence Fundamental Concepts of Data and Knowledge > Explainable AI Technologies > Machine Learning","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"87 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135539395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Wai Sze YIP, Hengzhou Edward Yan, Baolong Zhang, Suet To
Abstract Ultra‐precision machining (UPM), one of the most advanced machining techniques that can produce exact components, significantly impacts the technological community. The significance of UPM attracts the attention of academic and industrial partners. As a result of the rapid development of UPM caused by technological advancement, it is necessary to revisit the current stages and evolution of UPM to sustain and advance this technology. The state of the art in UPM is first investigated systematically in this study by identifying the current four major UPM themes. The UPM thematic network is then built, along with a structural analysis of the network, to determine the interactions between each theme and the primary roles of theme members responsible for the interactions. Furthermore, the “bridge” role is assigned to the specific UPM theme content. On the other hand, Sentiment analysis is conducted to determine how the academic community at UPM feels about the themes for UPM research to focus on those themes with a need for more confidence. Considering the above findings, the future perspective of UPM and suggestions for its advancement are discussed and provided. This study provides a comprehensive understanding and the current state‐of‐the‐art review of UPM technology by a text mining technique to critically analyze its research content, as well as suggestions to enhance UPM development by focusing on its current challenges, thereby assisting academia and institutions in leveraging this technology to benefit society. This article is categorized under: Algorithmic Development > Text Mining Application Areas > Science and Technology Application Areas > Industry Specific Applications
{"title":"The state‐of‐art review of ultra‐precision machining using text mining: Identification of main themes and recommendations for the future direction","authors":"Wai Sze YIP, Hengzhou Edward Yan, Baolong Zhang, Suet To","doi":"10.1002/widm.1517","DOIUrl":"https://doi.org/10.1002/widm.1517","url":null,"abstract":"Abstract Ultra‐precision machining (UPM), one of the most advanced machining techniques that can produce exact components, significantly impacts the technological community. The significance of UPM attracts the attention of academic and industrial partners. As a result of the rapid development of UPM caused by technological advancement, it is necessary to revisit the current stages and evolution of UPM to sustain and advance this technology. The state of the art in UPM is first investigated systematically in this study by identifying the current four major UPM themes. The UPM thematic network is then built, along with a structural analysis of the network, to determine the interactions between each theme and the primary roles of theme members responsible for the interactions. Furthermore, the “bridge” role is assigned to the specific UPM theme content. On the other hand, Sentiment analysis is conducted to determine how the academic community at UPM feels about the themes for UPM research to focus on those themes with a need for more confidence. Considering the above findings, the future perspective of UPM and suggestions for its advancement are discussed and provided. This study provides a comprehensive understanding and the current state‐of‐the‐art review of UPM technology by a text mining technique to critically analyze its research content, as well as suggestions to enhance UPM development by focusing on its current challenges, thereby assisting academia and institutions in leveraging this technology to benefit society. This article is categorized under: Algorithmic Development > Text Mining Application Areas > Science and Technology Application Areas > Industry Specific Applications","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135758509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Cheng Zhang, Nilam Nur Amir Sjarif, Roslina Ibrahim
Abstract Accurately predicting the prices of financial time series is essential and challenging for the financial sector. Owing to recent advancements in deep learning techniques, deep learning models are gradually replacing traditional statistical and machine learning models as the first choice for price forecasting tasks. This shift in model selection has led to a notable rise in research related to applying deep learning models to price forecasting, resulting in a rapid accumulation of new knowledge. Therefore, we conducted a literature review of relevant studies over the past 3 years with a view to aiding researchers and practitioners in the field. This review delves deeply into deep learning‐based forecasting models, presenting information on model architectures, practical applications, and their respective advantages and disadvantages. In particular, detailed information is provided on advanced models for price forecasting, such as Transformers, generative adversarial networks (GANs), graph neural networks (GNNs), and deep quantum neural networks (DQNNs). The present contribution also includes potential directions for future research, such as examining the effectiveness of deep learning models with complex structures for price forecasting, extending from point prediction to interval prediction using deep learning models, scrutinizing the reliability and validity of decomposition ensembles, and exploring the influence of data volume on model performance. This article is categorized under: Technologies > Prediction Technologies > Artificial Intelligence
{"title":"Deep learning models for price forecasting of financial time series: A review of recent advancements: 2020–2022","authors":"Cheng Zhang, Nilam Nur Amir Sjarif, Roslina Ibrahim","doi":"10.1002/widm.1519","DOIUrl":"https://doi.org/10.1002/widm.1519","url":null,"abstract":"Abstract Accurately predicting the prices of financial time series is essential and challenging for the financial sector. Owing to recent advancements in deep learning techniques, deep learning models are gradually replacing traditional statistical and machine learning models as the first choice for price forecasting tasks. This shift in model selection has led to a notable rise in research related to applying deep learning models to price forecasting, resulting in a rapid accumulation of new knowledge. Therefore, we conducted a literature review of relevant studies over the past 3 years with a view to aiding researchers and practitioners in the field. This review delves deeply into deep learning‐based forecasting models, presenting information on model architectures, practical applications, and their respective advantages and disadvantages. In particular, detailed information is provided on advanced models for price forecasting, such as Transformers, generative adversarial networks (GANs), graph neural networks (GNNs), and deep quantum neural networks (DQNNs). The present contribution also includes potential directions for future research, such as examining the effectiveness of deep learning models with complex structures for price forecasting, extending from point prediction to interval prediction using deep learning models, scrutinizing the reliability and validity of decomposition ensembles, and exploring the influence of data volume on model performance. This article is categorized under: Technologies > Prediction Technologies > Artificial Intelligence","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135344957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract Large language models (LLMs) have substantially pushed artificial intelligence (AI) research and applications in the last few years. They are currently able to achieve high effectiveness in different natural language processing (NLP) tasks, such as machine translation, named entity recognition, text classification, question answering, or text summarization. Recently, significant attention has been drawn to OpenAI's GPT models' capabilities and extremely accessible interface. LLMs are nowadays routinely used and studied for downstream tasks and specific applications with great success, pushing forward the state of the art in almost all of them. However, they also exhibit impressive inference capabilities when used off the shelf without further training. In this paper, we aim to study the behavior of pre‐trained language models (PLMs) in some inference tasks they were not initially trained for. Therefore, we focus our attention on very recent research works related to the inference capabilities of PLMs in some selected tasks such as factual probing and common‐sense reasoning. We highlight relevant achievements made by these models, as well as some of their current limitations that open opportunities for further research. This article is categorized under: Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Technologies > Artificial Intelligence
{"title":"<scp>Pre‐trained</scp> language models: What do they know?","authors":"Nuno Guimarães, Ricardo Campos, Alípio Jorge","doi":"10.1002/widm.1518","DOIUrl":"https://doi.org/10.1002/widm.1518","url":null,"abstract":"Abstract Large language models (LLMs) have substantially pushed artificial intelligence (AI) research and applications in the last few years. They are currently able to achieve high effectiveness in different natural language processing (NLP) tasks, such as machine translation, named entity recognition, text classification, question answering, or text summarization. Recently, significant attention has been drawn to OpenAI's GPT models' capabilities and extremely accessible interface. LLMs are nowadays routinely used and studied for downstream tasks and specific applications with great success, pushing forward the state of the art in almost all of them. However, they also exhibit impressive inference capabilities when used off the shelf without further training. In this paper, we aim to study the behavior of pre‐trained language models (PLMs) in some inference tasks they were not initially trained for. Therefore, we focus our attention on very recent research works related to the inference capabilities of PLMs in some selected tasks such as factual probing and common‐sense reasoning. We highlight relevant achievements made by these models, as well as some of their current limitations that open opportunities for further research. This article is categorized under: Fundamental Concepts of Data and Knowledge > Key Design Issues in Data Mining Technologies > Artificial Intelligence","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136152980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jameel Ahmad, Muhammad Umer Zia, Ijaz Haider Naqvi, Jawwad Nasar Chattha, Faran Awais Butt, Tao Huang, Wei Xiang
Abstract Future connected and autonomous vehicles (CAVs) must be secured against cyberattacks for their everyday functions on the road so that safety of passengers and vehicles can be ensured. This article presents a holistic review of cybersecurity attacks on sensors and threats regarding multi‐modal sensor fusion. A comprehensive review of cyberattacks on intra‐vehicle and inter‐vehicle communications is presented afterward. Besides the analysis of conventional cybersecurity threats and countermeasures for CAV systems, a detailed review of modern machine learning, federated learning, and blockchain approach is also conducted to safeguard CAVs. Machine learning and data mining‐aided intrusion detection systems and other countermeasures dealing with these challenges are elaborated at the end of the related section. In the last section, research challenges and future directions are identified. This article is categorized under: Commercial, Legal, and Ethical Issues > Security and Privacy Technologies > Machine Learning Technologies > Internet of Things
{"title":"Machine learning and blockchain technologies for cybersecurity in connected vehicles","authors":"Jameel Ahmad, Muhammad Umer Zia, Ijaz Haider Naqvi, Jawwad Nasar Chattha, Faran Awais Butt, Tao Huang, Wei Xiang","doi":"10.1002/widm.1515","DOIUrl":"https://doi.org/10.1002/widm.1515","url":null,"abstract":"Abstract Future connected and autonomous vehicles (CAVs) must be secured against cyberattacks for their everyday functions on the road so that safety of passengers and vehicles can be ensured. This article presents a holistic review of cybersecurity attacks on sensors and threats regarding multi‐modal sensor fusion. A comprehensive review of cyberattacks on intra‐vehicle and inter‐vehicle communications is presented afterward. Besides the analysis of conventional cybersecurity threats and countermeasures for CAV systems, a detailed review of modern machine learning, federated learning, and blockchain approach is also conducted to safeguard CAVs. Machine learning and data mining‐aided intrusion detection systems and other countermeasures dealing with these challenges are elaborated at the end of the related section. In the last section, research challenges and future directions are identified. This article is categorized under: Commercial, Legal, and Ethical Issues > Security and Privacy Technologies > Machine Learning Technologies > Internet of Things","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135107664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Sepehr Ghazinoory, Jinus Roshandel, Fatemeh Parvin, Shohreh Nasri, Mehdi Fatemi
Smart cities are one of the consequences of digital transformation, and there have been many attempts to assess the smartness of cities with various frameworks. Among these frameworks, smart city maturity models (SCMMs) evaluate the existing conditions of cities and provide guidelines for progressing through the subsequent stages of maturity. However, most maturity models follow the instructions of the first model, published by the International Data Corporation, and there are many similarities across the models. These maturity models have advantages and disadvantages, while previous studies have not addressed the differences. Therefore, this article fills this knowledge gap by systematically reviewing the existing SCMMs. The findings suggest that some trending topics, such as resiliency concerning global pandemics and cultural aspects are neglected in SCMMs. Moreover, the validation techniques of the models are not rational. Finally, given the theoretical nature of most models, they cannot be applied to multiple regions.This article is categorized under: Fundamental Concepts of Data and Knowledge > Big Data Mining Technologies > Artificial Intelligence Technologies > Machine Learning
智慧城市是数字化转型的结果之一,人们已经尝试用各种框架来评估城市的智慧。在这些框架中,智慧城市成熟度模型(scmm)评估城市的现有条件,并为随后的成熟阶段提供指导。然而,大多数成熟度模型都遵循由International Data Corporation发布的第一个模型的说明,并且这些模型之间有许多相似之处。这些成熟度模型各有优缺点,而以往的研究并没有解决这些差异。因此,本文通过系统地回顾现有的scm来填补这一知识空白。研究结果表明,scmm忽略了一些趋势主题,例如与全球流行病和文化方面有关的弹性。此外,模型的验证技术也不合理。最后,考虑到大多数模型的理论性质,它们不能适用于多个地区。本文分类如下:数据和知识的基本概念>大数据挖掘技术;人工智能技术;机器学习
{"title":"Smart city maturity models: A multidimensional synthesized approach","authors":"Sepehr Ghazinoory, Jinus Roshandel, Fatemeh Parvin, Shohreh Nasri, Mehdi Fatemi","doi":"10.1002/widm.1516","DOIUrl":"https://doi.org/10.1002/widm.1516","url":null,"abstract":"Smart cities are one of the consequences of digital transformation, and there have been many attempts to assess the smartness of cities with various frameworks. Among these frameworks, smart city maturity models (SCMMs) evaluate the existing conditions of cities and provide guidelines for progressing through the subsequent stages of maturity. However, most maturity models follow the instructions of the first model, published by the International Data Corporation, and there are many similarities across the models. These maturity models have advantages and disadvantages, while previous studies have not addressed the differences. Therefore, this article fills this knowledge gap by systematically reviewing the existing SCMMs. The findings suggest that some trending topics, such as resiliency concerning global pandemics and cultural aspects are neglected in SCMMs. Moreover, the validation techniques of the models are not rational. Finally, given the theoretical nature of most models, they cannot be applied to multiple regions.This article is categorized under:\u0000Fundamental Concepts of Data and Knowledge > Big Data Mining\u0000Technologies > Artificial Intelligence\u0000Technologies > Machine Learning","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135437834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qazi Mohammad Areeb, Mohammad Nadeem, Shahab Saquib Sohail, Raza Imam, Faiyaz Doctor, Yassine Himeur, Amir Hussain, Abbes Amira
Abstract A filter bubble refers to the phenomenon where Internet customization effectively isolates individuals from diverse opinions or materials, resulting in their exposure to only a select set of content. This can lead to the reinforcement of existing attitudes, beliefs, or conditions. In this study, our primary focus is to investigate the impact of filter bubbles in recommender systems (RSs). This pioneering research aims to uncover the reasons behind this problem, explore potential solutions, and propose an integrated tool to help users avoid filter bubbles in RSs. To achieve this objective, we conduct a systematic literature review on the topic of filter bubbles in RSs. The reviewed articles are carefully analyzed and classified, providing valuable insights that inform the development of an integrated approach. Notably, our review reveals evidence of filter bubbles in RSs, highlighting several biases that contribute to their existence. Moreover, we propose mechanisms to mitigate the impact of filter bubbles and demonstrate that incorporating diversity into recommendations can potentially help alleviate this issue. The findings of this timely review will serve as a benchmark for researchers working in interdisciplinary fields such as privacy, artificial intelligence ethics, and RSs. Furthermore, it will open new avenues for future research in related domains, prompting further exploration and advancement in this critical area. This article is categorized under: Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction Application Areas > Internet Commercial, Legal, and Ethical Issues > Ethical Considerations Commercial, Legal, and Ethical Issues > Security and Privacy
{"title":"Filter bubbles in recommender systems: Fact or fallacy—A systematic review","authors":"Qazi Mohammad Areeb, Mohammad Nadeem, Shahab Saquib Sohail, Raza Imam, Faiyaz Doctor, Yassine Himeur, Amir Hussain, Abbes Amira","doi":"10.1002/widm.1512","DOIUrl":"https://doi.org/10.1002/widm.1512","url":null,"abstract":"Abstract A filter bubble refers to the phenomenon where Internet customization effectively isolates individuals from diverse opinions or materials, resulting in their exposure to only a select set of content. This can lead to the reinforcement of existing attitudes, beliefs, or conditions. In this study, our primary focus is to investigate the impact of filter bubbles in recommender systems (RSs). This pioneering research aims to uncover the reasons behind this problem, explore potential solutions, and propose an integrated tool to help users avoid filter bubbles in RSs. To achieve this objective, we conduct a systematic literature review on the topic of filter bubbles in RSs. The reviewed articles are carefully analyzed and classified, providing valuable insights that inform the development of an integrated approach. Notably, our review reveals evidence of filter bubbles in RSs, highlighting several biases that contribute to their existence. Moreover, we propose mechanisms to mitigate the impact of filter bubbles and demonstrate that incorporating diversity into recommendations can potentially help alleviate this issue. The findings of this timely review will serve as a benchmark for researchers working in interdisciplinary fields such as privacy, artificial intelligence ethics, and RSs. Furthermore, it will open new avenues for future research in related domains, prompting further exploration and advancement in this critical area. This article is categorized under: Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction Application Areas > Internet Commercial, Legal, and Ethical Issues > Ethical Considerations Commercial, Legal, and Ethical Issues > Security and Privacy","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136228956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Abstract Understanding and comprehending humans' views, beliefs, attitudes, or opinions toward a particular entity is sentiment analysis (SA). Advancements in e‐commerce platforms has led to an abundance of the real‐time and free forms of opinions floating on social media platforms. This real‐world data are imprecise and vague hence fuzzy logic is required to deal with such subjective data. Since opinions can be fuzzy in nature and definitions of opinion words can be elucidated differently; fuzzy logic has witnessed itself as an effective method to capture the expression of opinions. The study presents an elaborate review of the around 170 published research works for SA using fuzzy logic. The primary emphasis is focused on text‐based SA, audio‐based SA, and fusion of text‐audio features‐based SA. This article discusses the various novel ways of classifying fuzzy logic‐based SA research articles, which have not been accomplished by any other review article till date. The article puts forward the importance of SA tasks and identifies how fuzzy logic adds to this importance. Finally, the article outlines a taxonomy for sentiment classification based on the technique‐supervised and unsupervised in the SA models and comprehensively reviews the SA approaches specific to their task. Prominently, this study highlights the suitability of fuzzy‐based SA approaches into five different classes vis‐a‐vis (a) Sentiment Cognition from Words using fuzzy logic, (b) Sentiment Cognition from Phrases using fuzzy logic, (c) Fuzzy‐rule based SA, (d) Neuro‐fuzzy network‐based SA, and (e) Fuzzy Emotion Recognition. This article is categorized under: Algorithmic Development > Text Mining Fundamental Concepts of Data and Knowledge > Motivation and Emergence of Data Mining
{"title":"Sentiment analysis using fuzzy logic: A comprehensive literature review","authors":"Srishti Vashishtha, Vedika Gupta, Mamta Mittal","doi":"10.1002/widm.1509","DOIUrl":"https://doi.org/10.1002/widm.1509","url":null,"abstract":"Abstract Understanding and comprehending humans' views, beliefs, attitudes, or opinions toward a particular entity is sentiment analysis (SA). Advancements in e‐commerce platforms has led to an abundance of the real‐time and free forms of opinions floating on social media platforms. This real‐world data are imprecise and vague hence fuzzy logic is required to deal with such subjective data. Since opinions can be fuzzy in nature and definitions of opinion words can be elucidated differently; fuzzy logic has witnessed itself as an effective method to capture the expression of opinions. The study presents an elaborate review of the around 170 published research works for SA using fuzzy logic. The primary emphasis is focused on text‐based SA, audio‐based SA, and fusion of text‐audio features‐based SA. This article discusses the various novel ways of classifying fuzzy logic‐based SA research articles, which have not been accomplished by any other review article till date. The article puts forward the importance of SA tasks and identifies how fuzzy logic adds to this importance. Finally, the article outlines a taxonomy for sentiment classification based on the technique‐supervised and unsupervised in the SA models and comprehensively reviews the SA approaches specific to their task. Prominently, this study highlights the suitability of fuzzy‐based SA approaches into five different classes vis‐a‐vis (a) Sentiment Cognition from Words using fuzzy logic, (b) Sentiment Cognition from Phrases using fuzzy logic, (c) Fuzzy‐rule based SA, (d) Neuro‐fuzzy network‐based SA, and (e) Fuzzy Emotion Recognition. This article is categorized under: Algorithmic Development > Text Mining Fundamental Concepts of Data and Knowledge > Motivation and Emergence of Data Mining","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135138711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Savaş Takan, Duygu Ergün, Sinem Getir Yaman, Onur Kılınççeker
Abstract The fairness of human‐related software has become critical with its widespread use in our daily lives, where life‐changing decisions are made. However, with the use of these systems, many erroneous results emerged. Technologies have started to be developed to tackle unexpected results. As for the solution to the issue, companies generally focus on algorithm‐oriented errors. The utilized solutions usually only work in some algorithms. Because the cause of the problem is not just the algorithm; it is also the data itself. For instance, deep learning cannot establish the cause–effect relationship quickly. In addition, the boundaries between statistical or heuristic algorithms are unclear. The algorithm's fairness may vary depending on the data related to context. From this point of view, our article focuses on how the data should be, which is not a matter of statistics. In this direction, the picture in question has been revealed through a scenario specific to “vulnerable and disadvantaged” groups, which is one of the most fundamental problems today. With the joint contribution of computer science and social sciences, it aims to predict the possible social dangers that may arise from artificial intelligence algorithms using the clues obtained in this study. To highlight the potential social and mass problems caused by data, Gerbner's “cultivation theory” is reinterpreted. To this end, we conduct an experimental evaluation on popular algorithms and their data sets, such as Word2Vec, GloVe, and ELMO. The article stresses the importance of a holistic approach combining the algorithm, data, and an interdisciplinary assessment. This article is categorized under: Algorithmic Development > Statistics
{"title":"Bias in human data: A feedback from social sciences","authors":"Savaş Takan, Duygu Ergün, Sinem Getir Yaman, Onur Kılınççeker","doi":"10.1002/widm.1498","DOIUrl":"https://doi.org/10.1002/widm.1498","url":null,"abstract":"Abstract The fairness of human‐related software has become critical with its widespread use in our daily lives, where life‐changing decisions are made. However, with the use of these systems, many erroneous results emerged. Technologies have started to be developed to tackle unexpected results. As for the solution to the issue, companies generally focus on algorithm‐oriented errors. The utilized solutions usually only work in some algorithms. Because the cause of the problem is not just the algorithm; it is also the data itself. For instance, deep learning cannot establish the cause–effect relationship quickly. In addition, the boundaries between statistical or heuristic algorithms are unclear. The algorithm's fairness may vary depending on the data related to context. From this point of view, our article focuses on how the data should be, which is not a matter of statistics. In this direction, the picture in question has been revealed through a scenario specific to “vulnerable and disadvantaged” groups, which is one of the most fundamental problems today. With the joint contribution of computer science and social sciences, it aims to predict the possible social dangers that may arise from artificial intelligence algorithms using the clues obtained in this study. To highlight the potential social and mass problems caused by data, Gerbner's “cultivation theory” is reinterpreted. To this end, we conduct an experimental evaluation on popular algorithms and their data sets, such as Word2Vec, GloVe, and ELMO. The article stresses the importance of a holistic approach combining the algorithm, data, and an interdisciplinary assessment. This article is categorized under: Algorithmic Development > Statistics","PeriodicalId":500599,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135663510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}