Michał Tomaszewski, Zineb Bougriche, Jakub Osuchowski, Rafał Gasz
The present systematic review gives a broad overview of the image-based datasets on which emotion has been classified. These datasets are categorized into three groups: (1) those containing only facial images, (2) those incorporating facial images with additional contextual body features, and (3) those based on visual components with physiological or audio signals. This paper discusses datasets in each category, focusing on their structure, emotion categories, emotion sources, anonymization level, and suitability for machine learning applications. A key aspect of this study is mapping emotions in the datasets analyzed to Plutchik's Wheel of Emotions, allowing standardization based on a consistent psychological model. This alignment demonstrates the need for more thorough and balanced datasets encompassing a wider range of affective states and exposes discrepancies in the representation of emotions. Comprehensive statistics on each dataset are also provided in our study, allowing for comparisons on sample size, distribution of emotions, anonymization, and sources of emotion. To conclude, this study discusses future directions, for which multimodal datasets that incorporate images, audio, and biosignals are of high importance as are databases containing a larger variety of emotions mapped more evenly across Plutchik's model.
{"title":"Systematic Review of Image-Based Emotion Recognition Datasets: A Comprehensive Analysis","authors":"Michał Tomaszewski, Zineb Bougriche, Jakub Osuchowski, Rafał Gasz","doi":"10.1002/widm.70047","DOIUrl":"https://doi.org/10.1002/widm.70047","url":null,"abstract":"The present systematic review gives a broad overview of the image-based datasets on which emotion has been classified. These datasets are categorized into three groups: (1) those containing only facial images, (2) those incorporating facial images with additional contextual body features, and (3) those based on visual components with physiological or audio signals. This paper discusses datasets in each category, focusing on their structure, emotion categories, emotion sources, anonymization level, and suitability for machine learning applications. A key aspect of this study is mapping emotions in the datasets analyzed to Plutchik's Wheel of Emotions, allowing standardization based on a consistent psychological model. This alignment demonstrates the need for more thorough and balanced datasets encompassing a wider range of affective states and exposes discrepancies in the representation of emotions. Comprehensive statistics on each dataset are also provided in our study, allowing for comparisons on sample size, distribution of emotions, anonymization, and sources of emotion. To conclude, this study discusses future directions, for which multimodal datasets that incorporate images, audio, and biosignals are of high importance as are databases containing a larger variety of emotions mapped more evenly across Plutchik's model.","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"22 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145332055","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Akash Ghosh, Raghav Jain, Anubhav Jhangra, Sriparna Saha, Adam Jatowt
The widespread adoption of the Internet has transformed healthcare by enabling the digital storage, sharing, and management of medical documents. This shift has improved information access, enhanced patient care, and opened new avenues for research and innovation. As the volume of medical data available to clinicians and patients continues to grow, the need for effective summarization methods becomes increasingly critical. Recent breakthroughs in deep learning—particularly the emergence of Large Language Models (LLMs)—have further accelerated progress in this area. This paper provides a comprehensive survey of current techniques and emerging trends in medical document summarization.
{"title":"A Survey on Medical Document Summarization: From Machine Learning Techniques to Large Language Models","authors":"Akash Ghosh, Raghav Jain, Anubhav Jhangra, Sriparna Saha, Adam Jatowt","doi":"10.1002/widm.70045","DOIUrl":"https://doi.org/10.1002/widm.70045","url":null,"abstract":"The widespread adoption of the Internet has transformed healthcare by enabling the digital storage, sharing, and management of medical documents. This shift has improved information access, enhanced patient care, and opened new avenues for research and innovation. As the volume of medical data available to clinicians and patients continues to grow, the need for effective summarization methods becomes increasingly critical. Recent breakthroughs in deep learning—particularly the emergence of Large Language Models (LLMs)—have further accelerated progress in this area. This paper provides a comprehensive survey of current techniques and emerging trends in medical document summarization.","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"12 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145247627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Edward Hengzhou Yan, Feng Guo, Baolong Zhang, Muhammad Rehan, Delei Wang, Zhicheng Xu, Chi Ho Wong, Long Teng, Wai Sze Yip, Suet To
Precision machining, manufacturing components with superior surface quality and dimensional accuracy, increasingly leverages Internet of Things (IoT) technologies. This study employs a novel comparative text mining approach by systematically integrating tree maps, word clouds, keyword network analysis, and Pearson correlation to identify critical linkages between IoT and precision machining. By analyzing a scientific research database (2019–2023), this study highlights IoT's core competencies in enhancing precision machining, including real‐time monitoring, predictive maintenance, and data‐driven optimization. Furthermore, this study proposes actionable strategies, including neural network‐based cyber production systems, blockchain‐integrated IIoT platforms, and machine learning‐driven predictive models, for precision machining. These recommendations empower academia and industry to harness IoT to improve product quality and reduce costs in precision machining.This article is categorized under: Algorithmic Development > Text MiningFundamental Concepts of Data and Knowledge > Knowledge RepresentationTechnologies > Data Preprocessing
{"title":"Exploring the Application of the Internet of Things in Precision Machining by Comparative Text Mining","authors":"Edward Hengzhou Yan, Feng Guo, Baolong Zhang, Muhammad Rehan, Delei Wang, Zhicheng Xu, Chi Ho Wong, Long Teng, Wai Sze Yip, Suet To","doi":"10.1002/widm.70042","DOIUrl":"https://doi.org/10.1002/widm.70042","url":null,"abstract":"Precision machining, manufacturing components with superior surface quality and dimensional accuracy, increasingly leverages Internet of Things (IoT) technologies. This study employs a novel comparative text mining approach by systematically integrating tree maps, word clouds, keyword network analysis, and Pearson correlation to identify critical linkages between IoT and precision machining. By analyzing a scientific research database (2019–2023), this study highlights IoT's core competencies in enhancing precision machining, including real‐time monitoring, predictive maintenance, and data‐driven optimization. Furthermore, this study proposes actionable strategies, including neural network‐based cyber production systems, blockchain‐integrated IIoT platforms, and machine learning‐driven predictive models, for precision machining. These recommendations empower academia and industry to harness IoT to improve product quality and reduce costs in precision machining.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Algorithmic Development > Text Mining</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge > Knowledge Representation</jats:list-item> <jats:list-item>Technologies > Data Preprocessing</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"34 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144995182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Machine learning models often face significant challenges while dealing with imbalanced and unlabeled datasets. Addressing these issues is resource‐intensive, requiring comprehensive strategies to navigate their individual complexities and compounded effects. This article explores the dual challenges imposed by class imbalance and the absence of labeled data, along with their individual complexities and combined effects on the performance of the model. This study addresses approaches for handling the imbalance problem in datasets, such as data‐level, algorithm‐level, and deep learning methods. The survey also examines hybrid methodologies that integrate these strategies to tackle the compounded issues effectively. Emerging techniques like Bayesian graph‐based learning, uncertainty‐guided semi‐supervised learning, and self‐supervised approaches are also considered for their potential to address the scalability, noise filtering, and generalization challenges associated with imbalanced and unlabeled datasets. It identified persistent gaps, such as the lack of robust evaluation metrics and the underutilization of dynamic feature extraction techniques, suggesting solutions with advanced machine learning approaches. Additionally, the need for adaptive techniques, such as dynamic class weighting and data‐driven filtering mechanisms, is highlighted to address limitations and improve the scalability of machine learning models in real‐world applications.This article is categorized under: Technologies > Machine LearningTechnologies > ClassificationTechnologies > Artificial Intelligence
{"title":"A Review of Unlabeled and Imbalanced Data Challenges in Machine Learning: Strategies and Solutions","authors":"Neethu M S, Vinod Chandra S S","doi":"10.1002/widm.70043","DOIUrl":"https://doi.org/10.1002/widm.70043","url":null,"abstract":"Machine learning models often face significant challenges while dealing with imbalanced and unlabeled datasets. Addressing these issues is resource‐intensive, requiring comprehensive strategies to navigate their individual complexities and compounded effects. This article explores the dual challenges imposed by class imbalance and the absence of labeled data, along with their individual complexities and combined effects on the performance of the model. This study addresses approaches for handling the imbalance problem in datasets, such as data‐level, algorithm‐level, and deep learning methods. The survey also examines hybrid methodologies that integrate these strategies to tackle the compounded issues effectively. Emerging techniques like Bayesian graph‐based learning, uncertainty‐guided semi‐supervised learning, and self‐supervised approaches are also considered for their potential to address the scalability, noise filtering, and generalization challenges associated with imbalanced and unlabeled datasets. It identified persistent gaps, such as the lack of robust evaluation metrics and the underutilization of dynamic feature extraction techniques, suggesting solutions with advanced machine learning approaches. Additionally, the need for adaptive techniques, such as dynamic class weighting and data‐driven filtering mechanisms, is highlighted to address limitations and improve the scalability of machine learning models in real‐world applications.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Technologies > Classification</jats:list-item> <jats:list-item>Technologies > Artificial Intelligence</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"53 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144910693","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hamed Zamanian, Ahmad Shalbaf, Maryam Parvizi, Roohallah Alizadehsani, Ru‐San Tan, U. Rajendra Acharya
The global rise in fatty liver diseases is alarming. Traditional diagnostic methods include ultrasound, CT scans, MRI, and liver biopsies, the latter being the gold standard for diagnosis and treatment. Recent advancements in artificial intelligence (AI) have enhanced liver biopsy accuracy, improving treatment outcomes. This study investigates how various AI techniques aid histopathologists, gastroenterologists, and liver specialists in diagnosing and assessing liver damage due to abnormal fat accumulation. We conducted a systematic review of AI applications in evaluating fatty liver diseases, particularly through histopathological image analysis. Our search encompassed five scientific databases: PubMed Central, ACM Digital Library, IEEE Xplore, Scopus, and Google Scholar. We focused on peer‐reviewed articles, conference papers, theses, and book chapters, adhering to specific terminology. The data synthesis followed the PRISMA guidelines, comparing literature based on four key indices and their annual distribution. We evaluated 37 studies utilizing histopathological imaging for the diagnosis of non‐alcoholic fatty liver disease and non‐alcoholic steatohepatitis, including related conditions, metabolic dysfunction‐associated fatty liver disease and metabolic dysfunction‐associated steatohepatitis. The review summarized the performance of various algorithms and explored the distribution of machine learning efforts. Given the complexity of histopathological images, AI algorithms can effectively stratify liver samples affected by fat. Our findings indicate that AI's diagnostic performance closely matches traditional pathological interpretations, offering reliable results for clinical applications.This article is categorized under: Application Areas > Health CareTechnologies > Machine LearningTechnologies > Artificial Intelligence
全球脂肪肝发病率的上升令人担忧。传统的诊断方法包括超声、CT扫描、MRI和肝活检,后者是诊断和治疗的金标准。人工智能(AI)的最新进展提高了肝活检的准确性,改善了治疗效果。本研究探讨了各种人工智能技术如何帮助组织病理学家、胃肠病学家和肝脏专家诊断和评估由异常脂肪堆积引起的肝损伤。我们对人工智能在脂肪肝疾病评估中的应用进行了系统回顾,特别是通过组织病理学图像分析。我们的搜索包括5个科学数据库:PubMed Central、ACM Digital Library、IEEE explore、Scopus和b谷歌Scholar。我们专注于同行评审的文章,会议论文,论文和书籍章节,坚持特定的术语。数据综合遵循PRISMA指南,根据四个关键指标及其年度分布比较文献。我们评估了37项利用组织病理学成像诊断非酒精性脂肪性肝病和非酒精性脂肪性肝炎的研究,包括相关疾病、代谢功能障碍相关的脂肪性肝病和代谢功能障碍相关的脂肪性肝炎。这篇综述总结了各种算法的性能,并探讨了机器学习工作的分布。考虑到组织病理图像的复杂性,人工智能算法可以有效地对受脂肪影响的肝脏样本进行分层。我们的研究结果表明,人工智能的诊断性能与传统的病理解释非常接近,为临床应用提供了可靠的结果。本文分类如下:应用领域>;医疗保健技术>;机器学习技术>;人工智能
{"title":"Automated Detection of Non‐Alcoholic Fatty Liver Disease Using Histopathological Images: A Systematic Review","authors":"Hamed Zamanian, Ahmad Shalbaf, Maryam Parvizi, Roohallah Alizadehsani, Ru‐San Tan, U. Rajendra Acharya","doi":"10.1002/widm.70044","DOIUrl":"https://doi.org/10.1002/widm.70044","url":null,"abstract":"The global rise in fatty liver diseases is alarming. Traditional diagnostic methods include ultrasound, CT scans, MRI, and liver biopsies, the latter being the gold standard for diagnosis and treatment. Recent advancements in artificial intelligence (AI) have enhanced liver biopsy accuracy, improving treatment outcomes. This study investigates how various AI techniques aid histopathologists, gastroenterologists, and liver specialists in diagnosing and assessing liver damage due to abnormal fat accumulation. We conducted a systematic review of AI applications in evaluating fatty liver diseases, particularly through histopathological image analysis. Our search encompassed five scientific databases: PubMed Central, ACM Digital Library, IEEE Xplore, Scopus, and Google Scholar. We focused on peer‐reviewed articles, conference papers, theses, and book chapters, adhering to specific terminology. The data synthesis followed the PRISMA guidelines, comparing literature based on four key indices and their annual distribution. We evaluated 37 studies utilizing histopathological imaging for the diagnosis of non‐alcoholic fatty liver disease and non‐alcoholic steatohepatitis, including related conditions, metabolic dysfunction‐associated fatty liver disease and metabolic dysfunction‐associated steatohepatitis. The review summarized the performance of various algorithms and explored the distribution of machine learning efforts. Given the complexity of histopathological images, AI algorithms can effectively stratify liver samples affected by fat. Our findings indicate that AI's diagnostic performance closely matches traditional pathological interpretations, offering reliable results for clinical applications.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas > Health Care</jats:list-item> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Technologies > Artificial Intelligence</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144915643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The application of argument mining (AM) in the educational domain is a tool for identifying text structures that express an argument. AM can help evaluate the quality of students' assignments, generate insights into their perspectives, and understand their stance on certain topics. This article examines various aspects of AM in education, including techniques, models, approaches, data representation, language resources, and target artifacts. The findings suggest that AM can enhance learning and teaching processes. However, the study highlights gaps in the literature, particularly in exploring educational artifacts like debates and a lack of research on AM in languages other than English. This paper calls for further research to improve educational outcomes through AM in the educational domain.This article is categorized under: Application Areas > Education and LearningTechnologies > Artificial IntelligenceTechnologies > Machine Learning
{"title":"A Comprehensive Survey of Argument Mining in the Educational Domain: Techniques, Applications, and Future Directions","authors":"David Eduardo Pereira, Daniela Thuaslar Simão Gomes, Larissa Lucena Vasconcelos, Claudio Elizio Calazans Campelo","doi":"10.1002/widm.70041","DOIUrl":"https://doi.org/10.1002/widm.70041","url":null,"abstract":"The application of argument mining (AM) in the educational domain is a tool for identifying text structures that express an argument. AM can help evaluate the quality of students' assignments, generate insights into their perspectives, and understand their stance on certain topics. This article examines various aspects of AM in education, including techniques, models, approaches, data representation, language resources, and target artifacts. The findings suggest that AM can enhance learning and teaching processes. However, the study highlights gaps in the literature, particularly in exploring educational artifacts like debates and a lack of research on AM in languages other than English. This paper calls for further research to improve educational outcomes through AM in the educational domain.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas > Education and Learning</jats:list-item> <jats:list-item>Technologies > Artificial Intelligence</jats:list-item> <jats:list-item>Technologies > Machine Learning</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144900517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The rapid proliferation of the Internet of Things (IoT) has integrated billions of smart devices into our daily lives, generating and exchanging vast amounts of critical data. While this connectivity offers significant benefits, it also introduces numerous security vulnerabilities. Addressing these vulnerabilities requires a comprehensive approach to hardware security, one that evaluates the interplay of various attacks and countermeasures to protect these systems. This article provides an extensive overview of hardware security strategies and explores contemporary attacks threatening connected systems. We begin by presenting state‐of‐the‐art side‐channel and fault attacks targeting embedded systems, emphasizing the wide range of IoT targets such as smart home devices, medical implants, industrial control systems, and automotive components. Next, we examine hardware‐based security primitives such as physically unclonable functions (PUFs) and physically related functions (PReFs), which have emerged as promising solutions for establishing a hardware root‐of‐trust in lightweight, resource‐constrained devices. These primitives provide robust alternatives to secure storage of cryptographic keys, essential for protecting the diverse array of IoT devices. Further, we discuss trusted architectures, hardware Trojans, and physical assurance mechanisms, highlighting their roles in enhancing security across different IoT environments. We conclude by exploring the expanse of machine learning‐assisted attacks, which present new and intriguing challenges across all the aforementioned security domains. This article aims to offer valuable insights into the current challenges and future directions of research in hardware security, particularly pertaining to the varied and expanding landscape of IoT devices.This article is categorized under: Technologies > Internet of ThingsTechnologies > Machine LearningCommercial, Legal, and Ethical Issues > Security and Privacy
{"title":"Hardware Security in the Connected World","authors":"Durba Chatterjee, Shuvodip Maitra, Nimish Mishra, Shubhi Shukla, Debdeep Mukhopadhyay","doi":"10.1002/widm.70034","DOIUrl":"https://doi.org/10.1002/widm.70034","url":null,"abstract":"The rapid proliferation of the Internet of Things (IoT) has integrated billions of smart devices into our daily lives, generating and exchanging vast amounts of critical data. While this connectivity offers significant benefits, it also introduces numerous security vulnerabilities. Addressing these vulnerabilities requires a comprehensive approach to hardware security, one that evaluates the interplay of various attacks and countermeasures to protect these systems. This article provides an extensive overview of hardware security strategies and explores contemporary attacks threatening connected systems. We begin by presenting state‐of‐the‐art side‐channel and fault attacks targeting embedded systems, emphasizing the wide range of IoT targets such as smart home devices, medical implants, industrial control systems, and automotive components. Next, we examine hardware‐based security primitives such as physically unclonable functions (PUFs) and physically related functions (PReFs), which have emerged as promising solutions for establishing a hardware root‐of‐trust in lightweight, resource‐constrained devices. These primitives provide robust alternatives to secure storage of cryptographic keys, essential for protecting the diverse array of IoT devices. Further, we discuss trusted architectures, hardware Trojans, and physical assurance mechanisms, highlighting their roles in enhancing security across different IoT environments. We conclude by exploring the expanse of machine learning‐assisted attacks, which present new and intriguing challenges across all the aforementioned security domains. This article aims to offer valuable insights into the current challenges and future directions of research in hardware security, particularly pertaining to the varied and expanding landscape of IoT devices.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Technologies > Internet of Things</jats:list-item> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Commercial, Legal, and Ethical Issues > Security and Privacy</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"179 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144850846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Brain–computer interfaces (BCIs) have emerged as transformative tools, enabling direct communication between the brain and external devices, particularly for individuals with neuromuscular disabilities. This paper provides a comprehensive analysis of feature extraction (FE) methods across all major signal processing domains and various types of BCIs, addressing a significant gap in existing reviews and surveys that often focus exclusively on EEG‐based systems. Also, a detailed comparative analysis of FE techniques, highlighting their formulas, advantages, limitations, and practical applications, is provided. The study not only reviews state‐of‐the‐art methods but also evaluates recent research, identifying trends and gaps in the field. Key insights reveal a growing foundation for invasive BCI research, which, while currently limited, shows promise for future advancements. Based on this analysis, we identify and discuss open challenges such as inter‐subject variability, real‐time processing demands, integration of multiple modalities, and user training and adaptation. Additionally, we examine pressing concerns related to security, privacy, and the transferability of models. By addressing these challenges, this paper aims to guide the development of robust, efficient, and inclusive BCI systems, paving the way for cutting‐edge innovations and real‐world applications.This article is categorized under: Technologies > Machine LearningFundamental Concepts of Data and Knowledge > Human Centricity and User Interaction
{"title":"Exploring the Evolution of Feature Extraction Methods in Brain–Computer Interfaces (BCIs): A Systematic Review of Research Progress and Future Trends","authors":"Shweta Thakur, Samriti Thakur, Aryan Rana, Pankaj Kumar, Kranti Kumar, Chien‐Ming Chen","doi":"10.1002/widm.70040","DOIUrl":"https://doi.org/10.1002/widm.70040","url":null,"abstract":"Brain–computer interfaces (BCIs) have emerged as transformative tools, enabling direct communication between the brain and external devices, particularly for individuals with neuromuscular disabilities. This paper provides a comprehensive analysis of feature extraction (FE) methods across all major signal processing domains and various types of BCIs, addressing a significant gap in existing reviews and surveys that often focus exclusively on EEG‐based systems. Also, a detailed comparative analysis of FE techniques, highlighting their formulas, advantages, limitations, and practical applications, is provided. The study not only reviews state‐of‐the‐art methods but also evaluates recent research, identifying trends and gaps in the field. Key insights reveal a growing foundation for invasive BCI research, which, while currently limited, shows promise for future advancements. Based on this analysis, we identify and discuss open challenges such as inter‐subject variability, real‐time processing demands, integration of multiple modalities, and user training and adaptation. Additionally, we examine pressing concerns related to security, privacy, and the transferability of models. By addressing these challenges, this paper aims to guide the development of robust, efficient, and inclusive BCI systems, paving the way for cutting‐edge innovations and real‐world applications.This article is categorized under: <jats:list list-type=\"bullet\"> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"746 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144850845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Remote photoplethysmography (rPPG) has emerged as a vital technology for remote healthcare, offering non‐invasive and accessible health monitoring through off‐the‐shelf standard video cameras. rPPG facilitates the assessment of key health indicators like heart rate (HR), respiratory rate (RR), and blood oxygen saturation (SpO2) from video data, providing advantages in early disease diagnosis and routine health assessments. Recognizing its potential, researchers from multiple fields have substantially progressed rPPG by establishing a strong theoretical basis for signal acquisition and developing signal processing and data‐driven algorithms for rPPG extraction. While most rPPG reviews primarily focus on HR signal extraction methods, our research provides an overview of the potential scope of rPPG. We systematically organize research on rPPG signal acquisition and extraction techniques and provide a critical review of recent rPPG advancements in diverse health parameter estimation. Besides providing a thorough HR estimation review, we incorporate the extraction of derivative signals such as RR and SpO2 from rPPG data, including their applications and limitations. We also highlight the adaptation of Machine Learning (ML), Deep Learning (DL), and Computer Vision (CV) techniques with rPPG technologies, and accumulate available critical rPPG resources like datasets, codes, and tutorials. Finally, we identify challenges and research gaps, such as motion artifacts, varying lighting conditions, and differences in skin tone. We aim to uplift advancements in rPPG systems by outlining future research directions. Our comprehensive review aims to support the development of robust and safe applications by advancing the field of contactless health parameter sensing.This article is categorized under: Application Areas > Health CareTechnologies > Machine LearningFundamental Concepts of Data and Knowledge > Human Centricity and User Interaction
{"title":"A State‐Of‐The‐Art Survey of Remote Photoplethysmography for Contactless Health Parameters Sensing","authors":"Shadman Sakib, Zahid Hasan, Nirmalya Roy","doi":"10.1002/widm.70039","DOIUrl":"https://doi.org/10.1002/widm.70039","url":null,"abstract":"Remote photoplethysmography (rPPG) has emerged as a vital technology for remote healthcare, offering non‐invasive and accessible health monitoring through off‐the‐shelf standard video cameras. rPPG facilitates the assessment of key health indicators like heart rate (HR), respiratory rate (RR), and blood oxygen saturation (SpO<jats:sub>2</jats:sub>) from video data, providing advantages in early disease diagnosis and routine health assessments. Recognizing its potential, researchers from multiple fields have substantially progressed rPPG by establishing a strong theoretical basis for signal acquisition and developing signal processing and data‐driven algorithms for rPPG extraction. While most rPPG reviews primarily focus on HR signal extraction methods, our research provides an overview of the potential scope of rPPG. We systematically organize research on rPPG signal acquisition and extraction techniques and provide a critical review of recent rPPG advancements in diverse health parameter estimation. Besides providing a thorough HR estimation review, we incorporate the extraction of derivative signals such as RR and SpO<jats:sub>2</jats:sub> from rPPG data, including their applications and limitations. We also highlight the adaptation of Machine Learning (ML), Deep Learning (DL), and Computer Vision (CV) techniques with rPPG technologies, and accumulate available critical rPPG resources like datasets, codes, and tutorials. Finally, we identify challenges and research gaps, such as motion artifacts, varying lighting conditions, and differences in skin tone. We aim to uplift advancements in rPPG systems by outlining future research directions. Our comprehensive review aims to support the development of robust and safe applications by advancing the field of contactless health parameter sensing.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas > Health Care</jats:list-item> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144792362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Chronic diseases (CDs) present a global health challenge due to their complex, overlapping symptoms and the limitations of traditional diagnostic methods. Artificial intelligence (AI)‐based techniques, particularly Machine Learning (ML) and Meta‐Heuristic Optimization (MHO) algorithms, have emerged as powerful tools for addressing these challenges. This review examines ML and MHO‐based approaches for the multi‐classification of CDs, highlighting how MHO enhances ML frameworks by addressing key limitations such as class imbalance and suboptimal feature selection. Despite these advancements, MHO‐based methods face challenges, including computational complexity and algorithmic biases, which require further research. By critically analyzing existing studies and identifying gaps, this paper provides a foundation for developing more robust and efficient diagnostic models for CDs.This article is categorized under: Application Areas > Health CareTechnologies > Machine LearningTechnologies > Prediction
{"title":"Meta‐Heuristic Optimization for the Multi‐Classification of Chronic Disease: A Review With Machine Learning Perspectives","authors":"Akansha Singh, Nupur Prakash, Anurag Jain","doi":"10.1002/widm.70030","DOIUrl":"https://doi.org/10.1002/widm.70030","url":null,"abstract":"Chronic diseases (CDs) present a global health challenge due to their complex, overlapping symptoms and the limitations of traditional diagnostic methods. Artificial intelligence (AI)‐based techniques, particularly Machine Learning (ML) and Meta‐Heuristic Optimization (MHO) algorithms, have emerged as powerful tools for addressing these challenges. This review examines ML and MHO‐based approaches for the multi‐classification of CDs, highlighting how MHO enhances ML frameworks by addressing key limitations such as class imbalance and suboptimal feature selection. Despite these advancements, MHO‐based methods face challenges, including computational complexity and algorithmic biases, which require further research. By critically analyzing existing studies and identifying gaps, this paper provides a foundation for developing more robust and efficient diagnostic models for CDs.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas > Health Care</jats:list-item> <jats:list-item>Technologies > Machine Learning</jats:list-item> <jats:list-item>Technologies > Prediction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"148 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144747364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}