首页 > 最新文献

WIREs Data Mining and Knowledge Discovery最新文献

英文 中文
A Comprehensive Survey of Argument Mining in the Educational Domain: Techniques, Applications, and Future Directions 教育领域论证挖掘的综合研究:技术、应用和未来方向
Pub Date : 2025-08-26 DOI: 10.1002/widm.70041
David Eduardo Pereira, Daniela Thuaslar Simão Gomes, Larissa Lucena Vasconcelos, Claudio Elizio Calazans Campelo
The application of argument mining (AM) in the educational domain is a tool for identifying text structures that express an argument. AM can help evaluate the quality of students' assignments, generate insights into their perspectives, and understand their stance on certain topics. This article examines various aspects of AM in education, including techniques, models, approaches, data representation, language resources, and target artifacts. The findings suggest that AM can enhance learning and teaching processes. However, the study highlights gaps in the literature, particularly in exploring educational artifacts like debates and a lack of research on AM in languages other than English. This paper calls for further research to improve educational outcomes through AM in the educational domain.This article is categorized under: Application Areas > Education and Learning Technologies > Artificial Intelligence Technologies > Machine Learning
论点挖掘(AM)在教育领域的应用是一种识别表达论点的文本结构的工具。AM可以帮助评估学生作业的质量,对他们的观点产生见解,并了解他们对某些主题的立场。本文研究了AM在教育中的各个方面,包括技术、模型、方法、数据表示、语言资源和目标工件。研究结果表明,AM可以提高学习和教学过程。然而,该研究强调了文献上的差距,特别是在探索辩论等教育文物方面,以及对英语以外语言的AM研究的缺乏。本文呼吁进一步研究如何通过AM在教育领域改善教育成果。本文分类如下:应用领域;教育与学习技术;人工智能技术;机器学习
{"title":"A Comprehensive Survey of Argument Mining in the Educational Domain: Techniques, Applications, and Future Directions","authors":"David Eduardo Pereira, Daniela Thuaslar Simão Gomes, Larissa Lucena Vasconcelos, Claudio Elizio Calazans Campelo","doi":"10.1002/widm.70041","DOIUrl":"https://doi.org/10.1002/widm.70041","url":null,"abstract":"The application of argument mining (AM) in the educational domain is a tool for identifying text structures that express an argument. AM can help evaluate the quality of students' assignments, generate insights into their perspectives, and understand their stance on certain topics. This article examines various aspects of AM in education, including techniques, models, approaches, data representation, language resources, and target artifacts. The findings suggest that AM can enhance learning and teaching processes. However, the study highlights gaps in the literature, particularly in exploring educational artifacts like debates and a lack of research on AM in languages other than English. This paper calls for further research to improve educational outcomes through AM in the educational domain.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas &gt; Education and Learning</jats:list-item> <jats:list-item>Technologies &gt; Artificial Intelligence</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144900517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hardware Security in the Connected World 互联世界中的硬件安全
Pub Date : 2025-08-13 DOI: 10.1002/widm.70034
Durba Chatterjee, Shuvodip Maitra, Nimish Mishra, Shubhi Shukla, Debdeep Mukhopadhyay
The rapid proliferation of the Internet of Things (IoT) has integrated billions of smart devices into our daily lives, generating and exchanging vast amounts of critical data. While this connectivity offers significant benefits, it also introduces numerous security vulnerabilities. Addressing these vulnerabilities requires a comprehensive approach to hardware security, one that evaluates the interplay of various attacks and countermeasures to protect these systems. This article provides an extensive overview of hardware security strategies and explores contemporary attacks threatening connected systems. We begin by presenting state‐of‐the‐art side‐channel and fault attacks targeting embedded systems, emphasizing the wide range of IoT targets such as smart home devices, medical implants, industrial control systems, and automotive components. Next, we examine hardware‐based security primitives such as physically unclonable functions (PUFs) and physically related functions (PReFs), which have emerged as promising solutions for establishing a hardware root‐of‐trust in lightweight, resource‐constrained devices. These primitives provide robust alternatives to secure storage of cryptographic keys, essential for protecting the diverse array of IoT devices. Further, we discuss trusted architectures, hardware Trojans, and physical assurance mechanisms, highlighting their roles in enhancing security across different IoT environments. We conclude by exploring the expanse of machine learning‐assisted attacks, which present new and intriguing challenges across all the aforementioned security domains. This article aims to offer valuable insights into the current challenges and future directions of research in hardware security, particularly pertaining to the varied and expanding landscape of IoT devices.This article is categorized under: Technologies > Internet of Things Technologies > Machine Learning Commercial, Legal, and Ethical Issues > Security and Privacy
物联网(IoT)的快速发展使数十亿智能设备融入我们的日常生活,产生和交换大量关键数据。虽然这种连接提供了显著的好处,但它也引入了许多安全漏洞。解决这些漏洞需要一种全面的硬件安全方法,一种评估各种攻击的相互作用和对策以保护这些系统的方法。本文提供了硬件安全策略的广泛概述,并探讨了威胁连接系统的当代攻击。我们首先介绍了针对嵌入式系统的最先进的侧信道和故障攻击,强调了广泛的物联网目标,如智能家居设备、医疗植入物、工业控制系统和汽车部件。接下来,我们将研究基于硬件的安全原语,如物理不可克隆功能(puf)和物理相关功能(pref),它们已成为在轻量级资源受限设备中建立硬件信任根的有希望的解决方案。这些原语为加密密钥的安全存储提供了强大的替代方案,对于保护各种物联网设备至关重要。此外,我们还讨论了可信架构、硬件木马和物理保证机制,强调了它们在增强不同物联网环境安全性方面的作用。最后,我们探讨了机器学习辅助攻击的范围,这些攻击在上述所有安全领域都提出了新的和有趣的挑战。本文旨在为硬件安全研究的当前挑战和未来方向提供有价值的见解,特别是与物联网设备的变化和扩展有关。本文分类如下:技术>;物联网技术;机器学习商业、法律和伦理问题安全及私隐
{"title":"Hardware Security in the Connected World","authors":"Durba Chatterjee, Shuvodip Maitra, Nimish Mishra, Shubhi Shukla, Debdeep Mukhopadhyay","doi":"10.1002/widm.70034","DOIUrl":"https://doi.org/10.1002/widm.70034","url":null,"abstract":"The rapid proliferation of the Internet of Things (IoT) has integrated billions of smart devices into our daily lives, generating and exchanging vast amounts of critical data. While this connectivity offers significant benefits, it also introduces numerous security vulnerabilities. Addressing these vulnerabilities requires a comprehensive approach to hardware security, one that evaluates the interplay of various attacks and countermeasures to protect these systems. This article provides an extensive overview of hardware security strategies and explores contemporary attacks threatening connected systems. We begin by presenting state‐of‐the‐art side‐channel and fault attacks targeting embedded systems, emphasizing the wide range of IoT targets such as smart home devices, medical implants, industrial control systems, and automotive components. Next, we examine hardware‐based security primitives such as physically unclonable functions (PUFs) and physically related functions (PReFs), which have emerged as promising solutions for establishing a hardware root‐of‐trust in lightweight, resource‐constrained devices. These primitives provide robust alternatives to secure storage of cryptographic keys, essential for protecting the diverse array of IoT devices. Further, we discuss trusted architectures, hardware Trojans, and physical assurance mechanisms, highlighting their roles in enhancing security across different IoT environments. We conclude by exploring the expanse of machine learning‐assisted attacks, which present new and intriguing challenges across all the aforementioned security domains. This article aims to offer valuable insights into the current challenges and future directions of research in hardware security, particularly pertaining to the varied and expanding landscape of IoT devices.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Technologies &gt; Internet of Things</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> <jats:list-item>Commercial, Legal, and Ethical Issues &gt; Security and Privacy</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"179 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144850846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring the Evolution of Feature Extraction Methods in Brain–Computer Interfaces (BCIs): A Systematic Review of Research Progress and Future Trends 探讨脑机接口特征提取方法的演变:研究进展和未来趋势的系统综述
Pub Date : 2025-08-12 DOI: 10.1002/widm.70040
Shweta Thakur, Samriti Thakur, Aryan Rana, Pankaj Kumar, Kranti Kumar, Chien‐Ming Chen
Brain–computer interfaces (BCIs) have emerged as transformative tools, enabling direct communication between the brain and external devices, particularly for individuals with neuromuscular disabilities. This paper provides a comprehensive analysis of feature extraction (FE) methods across all major signal processing domains and various types of BCIs, addressing a significant gap in existing reviews and surveys that often focus exclusively on EEG‐based systems. Also, a detailed comparative analysis of FE techniques, highlighting their formulas, advantages, limitations, and practical applications, is provided. The study not only reviews state‐of‐the‐art methods but also evaluates recent research, identifying trends and gaps in the field. Key insights reveal a growing foundation for invasive BCI research, which, while currently limited, shows promise for future advancements. Based on this analysis, we identify and discuss open challenges such as inter‐subject variability, real‐time processing demands, integration of multiple modalities, and user training and adaptation. Additionally, we examine pressing concerns related to security, privacy, and the transferability of models. By addressing these challenges, this paper aims to guide the development of robust, efficient, and inclusive BCI systems, paving the way for cutting‐edge innovations and real‐world applications.This article is categorized under: Technologies > Machine Learning Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction
脑机接口(bci)已经成为一种变革性的工具,可以实现大脑和外部设备之间的直接通信,特别是对于神经肌肉残疾的个体。本文对所有主要信号处理领域和各种类型的脑机接口的特征提取(FE)方法进行了全面分析,解决了现有评论和调查中的重大差距,这些评论和调查通常只关注基于EEG的系统。此外,还对有限元技术进行了详细的比较分析,重点介绍了它们的公式、优点、局限性和实际应用。该研究不仅回顾了最先进的方法,还评估了最近的研究,确定了该领域的趋势和差距。关键的见解揭示了侵入性脑机接口研究的日益增长的基础,虽然目前有限,但显示出未来进步的希望。基于这一分析,我们确定并讨论了开放的挑战,如主体间的可变性、实时处理需求、多种模式的集成以及用户培训和适应。此外,我们还研究了与模型的安全性、隐私性和可移植性相关的紧迫问题。通过解决这些挑战,本文旨在指导稳健、高效、包容的BCI系统的发展,为前沿创新和现实世界的应用铺平道路。本文分类如下:技术>;机器学习:数据和知识的基本概念以人为中心和用户交互
{"title":"Exploring the Evolution of Feature Extraction Methods in Brain–Computer Interfaces (BCIs): A Systematic Review of Research Progress and Future Trends","authors":"Shweta Thakur, Samriti Thakur, Aryan Rana, Pankaj Kumar, Kranti Kumar, Chien‐Ming Chen","doi":"10.1002/widm.70040","DOIUrl":"https://doi.org/10.1002/widm.70040","url":null,"abstract":"Brain–computer interfaces (BCIs) have emerged as transformative tools, enabling direct communication between the brain and external devices, particularly for individuals with neuromuscular disabilities. This paper provides a comprehensive analysis of feature extraction (FE) methods across all major signal processing domains and various types of BCIs, addressing a significant gap in existing reviews and surveys that often focus exclusively on EEG‐based systems. Also, a detailed comparative analysis of FE techniques, highlighting their formulas, advantages, limitations, and practical applications, is provided. The study not only reviews state‐of‐the‐art methods but also evaluates recent research, identifying trends and gaps in the field. Key insights reveal a growing foundation for invasive BCI research, which, while currently limited, shows promise for future advancements. Based on this analysis, we identify and discuss open challenges such as inter‐subject variability, real‐time processing demands, integration of multiple modalities, and user training and adaptation. Additionally, we examine pressing concerns related to security, privacy, and the transferability of models. By addressing these challenges, this paper aims to guide the development of robust, efficient, and inclusive BCI systems, paving the way for cutting‐edge innovations and real‐world applications.This article is categorized under: <jats:list list-type=\"bullet\"> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge &gt; Human Centricity and User Interaction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"746 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144850845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A State‐Of‐The‐Art Survey of Remote Photoplethysmography for Contactless Health Parameters Sensing 用于非接触式健康参数传感的远程光容积脉搏图的最新研究
Pub Date : 2025-08-06 DOI: 10.1002/widm.70039
Shadman Sakib, Zahid Hasan, Nirmalya Roy
Remote photoplethysmography (rPPG) has emerged as a vital technology for remote healthcare, offering non‐invasive and accessible health monitoring through off‐the‐shelf standard video cameras. rPPG facilitates the assessment of key health indicators like heart rate (HR), respiratory rate (RR), and blood oxygen saturation (SpO2) from video data, providing advantages in early disease diagnosis and routine health assessments. Recognizing its potential, researchers from multiple fields have substantially progressed rPPG by establishing a strong theoretical basis for signal acquisition and developing signal processing and data‐driven algorithms for rPPG extraction. While most rPPG reviews primarily focus on HR signal extraction methods, our research provides an overview of the potential scope of rPPG. We systematically organize research on rPPG signal acquisition and extraction techniques and provide a critical review of recent rPPG advancements in diverse health parameter estimation. Besides providing a thorough HR estimation review, we incorporate the extraction of derivative signals such as RR and SpO2 from rPPG data, including their applications and limitations. We also highlight the adaptation of Machine Learning (ML), Deep Learning (DL), and Computer Vision (CV) techniques with rPPG technologies, and accumulate available critical rPPG resources like datasets, codes, and tutorials. Finally, we identify challenges and research gaps, such as motion artifacts, varying lighting conditions, and differences in skin tone. We aim to uplift advancements in rPPG systems by outlining future research directions. Our comprehensive review aims to support the development of robust and safe applications by advancing the field of contactless health parameter sensing.This article is categorized under: Application Areas > Health Care Technologies > Machine Learning Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction
远程光电容积脉搏波描记(rPPG)已经成为远程医疗的一项重要技术,通过现成的标准摄像机提供非侵入性和可访问的健康监测。rPPG有助于从视频数据中评估心率(HR)、呼吸频率(RR)、血氧饱和度(SpO2)等关键健康指标,为疾病早期诊断和常规健康评估提供优势。认识到rPPG的潜力,来自多个领域的研究人员通过建立强大的信号采集理论基础,开发用于rPPG提取的信号处理和数据驱动算法,大大推进了rPPG的发展。虽然大多数rPPG综述主要集中在HR信号提取方法上,但我们的研究概述了rPPG的潜在范围。我们系统地组织了rPPG信号采集和提取技术的研究,并对rPPG在各种健康参数估计方面的最新进展进行了综述。除了提供全面的HR估计综述外,我们还结合了从rPPG数据中提取衍生信号(如RR和SpO2),包括它们的应用和局限性。我们还强调了机器学习(ML),深度学习(DL)和计算机视觉(CV)技术与rPPG技术的适应,并积累了可用的关键rPPG资源,如数据集,代码和教程。最后,我们确定了挑战和研究差距,如运动伪影,不同的照明条件和肤色的差异。我们的目标是通过概述未来的研究方向来提升rPPG系统的进步。我们的综合综述旨在通过推进非接触式健康参数传感领域来支持稳健和安全应用的发展。本文分类如下:应用领域>;医疗保健技术;机器学习:数据和知识的基本概念以人为中心和用户交互
{"title":"A State‐Of‐The‐Art Survey of Remote Photoplethysmography for Contactless Health Parameters Sensing","authors":"Shadman Sakib, Zahid Hasan, Nirmalya Roy","doi":"10.1002/widm.70039","DOIUrl":"https://doi.org/10.1002/widm.70039","url":null,"abstract":"Remote photoplethysmography (rPPG) has emerged as a vital technology for remote healthcare, offering non‐invasive and accessible health monitoring through off‐the‐shelf standard video cameras. rPPG facilitates the assessment of key health indicators like heart rate (HR), respiratory rate (RR), and blood oxygen saturation (SpO<jats:sub>2</jats:sub>) from video data, providing advantages in early disease diagnosis and routine health assessments. Recognizing its potential, researchers from multiple fields have substantially progressed rPPG by establishing a strong theoretical basis for signal acquisition and developing signal processing and data‐driven algorithms for rPPG extraction. While most rPPG reviews primarily focus on HR signal extraction methods, our research provides an overview of the potential scope of rPPG. We systematically organize research on rPPG signal acquisition and extraction techniques and provide a critical review of recent rPPG advancements in diverse health parameter estimation. Besides providing a thorough HR estimation review, we incorporate the extraction of derivative signals such as RR and SpO<jats:sub>2</jats:sub> from rPPG data, including their applications and limitations. We also highlight the adaptation of Machine Learning (ML), Deep Learning (DL), and Computer Vision (CV) techniques with rPPG technologies, and accumulate available critical rPPG resources like datasets, codes, and tutorials. Finally, we identify challenges and research gaps, such as motion artifacts, varying lighting conditions, and differences in skin tone. We aim to uplift advancements in rPPG systems by outlining future research directions. Our comprehensive review aims to support the development of robust and safe applications by advancing the field of contactless health parameter sensing.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas &gt; Health Care</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge &gt; Human Centricity and User Interaction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144792362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Meta‐Heuristic Optimization for the Multi‐Classification of Chronic Disease: A Review With Machine Learning Perspectives 慢性病多重分类的元启发式优化:基于机器学习视角的综述
Pub Date : 2025-07-29 DOI: 10.1002/widm.70030
Akansha Singh, Nupur Prakash, Anurag Jain
Chronic diseases (CDs) present a global health challenge due to their complex, overlapping symptoms and the limitations of traditional diagnostic methods. Artificial intelligence (AI)‐based techniques, particularly Machine Learning (ML) and Meta‐Heuristic Optimization (MHO) algorithms, have emerged as powerful tools for addressing these challenges. This review examines ML and MHO‐based approaches for the multi‐classification of CDs, highlighting how MHO enhances ML frameworks by addressing key limitations such as class imbalance and suboptimal feature selection. Despite these advancements, MHO‐based methods face challenges, including computational complexity and algorithmic biases, which require further research. By critically analyzing existing studies and identifying gaps, this paper provides a foundation for developing more robust and efficient diagnostic models for CDs.This article is categorized under: Application Areas > Health Care Technologies > Machine Learning Technologies > Prediction
慢性疾病由于其复杂、重叠的症状和传统诊断方法的局限性,对全球健康构成了挑战。基于人工智能(AI)的技术,特别是机器学习(ML)和元启发式优化(MHO)算法,已经成为解决这些挑战的强大工具。本文研究了基于ML和基于MHO的cd多分类方法,强调了MHO如何通过解决类不平衡和次优特征选择等关键限制来增强ML框架。尽管取得了这些进步,但基于MHO的方法仍面临挑战,包括计算复杂性和算法偏差,这需要进一步研究。通过批判性地分析现有研究并找出差距,本文为开发更健壮和有效的cd诊断模型提供了基础。本文分类如下:应用领域>;医疗保健技术;机器学习技术;预测
{"title":"Meta‐Heuristic Optimization for the Multi‐Classification of Chronic Disease: A Review With Machine Learning Perspectives","authors":"Akansha Singh, Nupur Prakash, Anurag Jain","doi":"10.1002/widm.70030","DOIUrl":"https://doi.org/10.1002/widm.70030","url":null,"abstract":"Chronic diseases (CDs) present a global health challenge due to their complex, overlapping symptoms and the limitations of traditional diagnostic methods. Artificial intelligence (AI)‐based techniques, particularly Machine Learning (ML) and Meta‐Heuristic Optimization (MHO) algorithms, have emerged as powerful tools for addressing these challenges. This review examines ML and MHO‐based approaches for the multi‐classification of CDs, highlighting how MHO enhances ML frameworks by addressing key limitations such as class imbalance and suboptimal feature selection. Despite these advancements, MHO‐based methods face challenges, including computational complexity and algorithmic biases, which require further research. By critically analyzing existing studies and identifying gaps, this paper provides a foundation for developing more robust and efficient diagnostic models for CDs.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas &gt; Health Care</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> <jats:list-item>Technologies &gt; Prediction</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"148 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144747364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Guide to Machine Learning Epistemic Ignorance, Hidden Paradoxes, and Other Tensions 机器学习认知无知、隐藏的悖论和其他紧张关系指南
Pub Date : 2025-07-23 DOI: 10.1002/widm.70038
M. Z. Naser
Machine learning (ML) has rapidly scaled in capacity and complexity, yet blind spots persist beneath its high performance façade. In order to shed more light on this argument, this paper presents a curated catalogue of 175 unconventional concepts, each capturing a paradox, tension, or overlooked risk in modern ML practice. Through nine themes spanning data quality, model architecture and training, interpretability and explainability, fairness and bias, model behavior and limitations, evaluation and metrics, multimodal and system integration, practical and societal implications, and causal reasoning, we provide conceptual definitions, illustrative examples, and actionable mitigation strategies. This review equips practitioners and researchers with a structured taxonomy for diagnosing and preempting the brittle edges of modern ML systems and offers a paradox detection and remediation framework (PDRF) to anticipate limitations, design more thoughtful evaluation protocols, and develop ML systems that balance predictive power with epistemic transparency.This article is categorized under: Fundamental Concepts of Data and Knowledge > Data Concepts Fundamental Concepts of Data and Knowledge > Big Data Mining Technologies > Computational Intelligence
机器学习(ML)在容量和复杂性方面迅速扩大,但在其高性能表面下仍然存在盲点。为了更清楚地阐明这一论点,本文提出了175个非常规概念的策划目录,每个概念都抓住了现代机器学习实践中的悖论、紧张或被忽视的风险。通过九个主题,包括数据质量、模型架构和训练、可解释性和可解释性、公平性和偏见、模型行为和局限性、评估和度量、多模态和系统集成、实际和社会影响以及因果推理,我们提供了概念定义、说明性示例和可操作的缓解策略。这篇综述为从业者和研究人员提供了一个结构化的分类来诊断和预防现代机器学习系统的脆弱边缘,并提供了一个悖论检测和补救框架(PDRF)来预测局限性,设计更深思熟虑的评估协议,并开发平衡预测能力和认知透明度的机器学习系统。本文分类如下:数据和知识的基本概念>;数据与知识的基本概念大数据挖掘技术;计算智能
{"title":"A Guide to Machine Learning Epistemic Ignorance, Hidden Paradoxes, and Other Tensions","authors":"M. Z. Naser","doi":"10.1002/widm.70038","DOIUrl":"https://doi.org/10.1002/widm.70038","url":null,"abstract":"Machine learning (ML) has rapidly scaled in capacity and complexity, yet blind spots persist beneath its high performance façade. In order to shed more light on this argument, this paper presents a curated catalogue of 175 unconventional concepts, each capturing a paradox, tension, or overlooked risk in modern ML practice. Through nine themes spanning data quality, model architecture and training, interpretability and explainability, fairness and bias, model behavior and limitations, evaluation and metrics, multimodal and system integration, practical and societal implications, and causal reasoning, we provide conceptual definitions, illustrative examples, and actionable mitigation strategies. This review equips practitioners and researchers with a structured taxonomy for diagnosing and preempting the brittle edges of modern ML systems and offers a paradox detection and remediation framework (PDRF) to anticipate limitations, design more thoughtful evaluation protocols, and develop ML systems that balance predictive power with epistemic transparency.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Fundamental Concepts of Data and Knowledge &gt; Data Concepts</jats:list-item> <jats:list-item>Fundamental Concepts of Data and Knowledge &gt; Big Data Mining</jats:list-item> <jats:list-item>Technologies &gt; Computational Intelligence</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144693602","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical and Machine Learning Approaches for Electrical Energy Forecasting 电能预测的统计和机器学习方法
Pub Date : 2025-07-15 DOI: 10.1002/widm.70033
Solange Machado, Xingquan Zhu
With renewable energy being aggressively integrated into the grid, energy supplies are becoming vulnerable to weather and the environment, and are often incapable of meeting population demands at a large scale if not accurately predicted for energy planning. Understanding consumers' power demands ahead of time and the influences of weather on consumption and generation can help producers generate effective power management plans to support the target demand. In addition to the high correlation with the environment, consumers' behaviors also cause non‐stationary characteristics of energy data, which is the main challenge for energy prediction. In this survey, we perform a review of the literature on prediction methods in the energy field. So far, most of the available research encompasses one type of generation or consumption. There is no research approaching prediction in the energy sector as a whole and its correlated features. We propose to address the energy prediction challenges from both consumption and generation sides, encompassing techniques from statistical to machine learning techniques. We also summarize the work related to energy prediction, electricity measurements, challenges related to energy consumption and generation, energy forecasting methods, and real‐world energy forecasting resources, such as datasets and software solutions for energy prediction.This article is categorized under: Application Areas > Industry Specific Applications Technologies > Prediction Technologies > Machine Learning
随着可再生能源被积极地整合到电网中,能源供应变得越来越容易受到天气和环境的影响,如果不能准确地预测能源规划,往往无法满足大规模的人口需求。提前了解消费者的电力需求以及天气对消费和发电的影响,可以帮助生产商制定有效的电力管理计划,以支持目标需求。消费者行为除了与环境高度相关外,还会导致能源数据的非平稳特征,这是能源预测的主要挑战。在本调查中,我们对能源领域预测方法的文献进行了回顾。到目前为止,大多数可用的研究都只涉及一种类型的发电或消费。目前还没有针对能源行业整体预测及其相关特征的研究。我们建议从消费和发电两方面解决能源预测的挑战,包括从统计到机器学习技术的技术。我们还总结了与能源预测、电力测量、能源消耗和发电相关的挑战、能源预测方法和现实世界的能源预测资源(如能源预测的数据集和软件解决方案)相关的工作。本文分类如下:应用领域>;行业特定应用技术;预测技术;机器学习
{"title":"Statistical and Machine Learning Approaches for Electrical Energy Forecasting","authors":"Solange Machado, Xingquan Zhu","doi":"10.1002/widm.70033","DOIUrl":"https://doi.org/10.1002/widm.70033","url":null,"abstract":"With renewable energy being aggressively integrated into the grid, energy supplies are becoming vulnerable to weather and the environment, and are often incapable of meeting population demands at a large scale if not accurately predicted for energy planning. Understanding consumers' power demands ahead of time and the influences of weather on consumption and generation can help producers generate effective power management plans to support the target demand. In addition to the high correlation with the environment, consumers' behaviors also cause non‐stationary characteristics of energy data, which is the main challenge for energy prediction. In this survey, we perform a review of the literature on prediction methods in the energy field. So far, most of the available research encompasses one type of generation or consumption. There is no research approaching prediction in the energy sector as a whole and its correlated features. We propose to address the energy prediction challenges from both consumption and generation sides, encompassing techniques from statistical to machine learning techniques. We also summarize the work related to energy prediction, electricity measurements, challenges related to energy consumption and generation, energy forecasting methods, and real‐world energy forecasting resources, such as datasets and software solutions for energy prediction.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas &gt; Industry Specific Applications</jats:list-item> <jats:list-item>Technologies &gt; Prediction</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"671 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Systematic Literature Survey of Crowdsourcing: Current Status and Future Perspectives 众包的系统文献综述:现状与未来展望
Pub Date : 2025-07-14 DOI: 10.1002/widm.70037
Himanshu Suyal, Avtar Singh
Crowdsourcing has recently evolved as a distributed human problem‐solving method and has received considerable interest from academics and practitioners in various domains. The proliferation of crowdsourcing has made it much simpler to utilize the intelligence and adaptability of many people to learn new knowledge to solve the problem of acquiring new knowledge. In the past, numerous crowdsourcing works have highlighted multiple aspects; however, no surveys have been conducted that focus on the entire crowdsourcing process. This concentrated survey provides a comprehensive review of the technical advances from a systematic perspective. This survey systematically reviews technical advances for a crowdsourcing process that contains four dimensions: task modeling, crowdsourcing data acquisition, the learning process, and predictive model learning, and proposes a comprehensive and scalable framework from CROWD4AI (Crowdsourcing Framework with 4 Dimensions for Artificial Intelligence). In addition, this paper focuses on each dimension's potential challenges and future direction, encouraging researchers to participate in crowdsourcing. To bridge theory with practice, we also include a detailed case study that demonstrates the real‐world application of our proposed framework in the context of annotating cultural heritage damages using crowdsourced input. The case study illustrates how the framework supports effective task design, label collection, robust learning strategies, and accurate predictive modeling in a practical setting.This article is categorized under: Technologies > Crowdsourcing Technologies > Machine Learning
众包最近发展成为一种分布式的人类解决问题的方法,并受到了各个领域的学者和实践者的极大兴趣。众包的扩散使得利用许多人的智慧和适应性来学习新知识,解决获取新知识的问题变得更加简单。过去,众多众包作品都突出了多个方面;然而,目前还没有针对整个众包过程的调查。这个集中的调查从系统的角度对技术进步进行了全面的回顾。本调查系统地回顾了包含任务建模、众包数据获取、学习过程和预测模型学习四个维度的众包过程的技术进展,并提出了一个来自CROWD4AI(人工智能4维度众包框架)的全面且可扩展的框架。此外,本文还重点分析了各个维度的潜在挑战和未来方向,鼓励研究人员参与众包。为了将理论与实践联系起来,我们还包括了一个详细的案例研究,以展示我们提出的框架在使用众包输入来注释文化遗产损害的背景下的实际应用。案例研究说明了该框架如何在实际环境中支持有效的任务设计、标签收集、健壮的学习策略和准确的预测建模。本文分类如下:技术>;众包技术;机器学习
{"title":"A Systematic Literature Survey of Crowdsourcing: Current Status and Future Perspectives","authors":"Himanshu Suyal, Avtar Singh","doi":"10.1002/widm.70037","DOIUrl":"https://doi.org/10.1002/widm.70037","url":null,"abstract":"Crowdsourcing has recently evolved as a distributed human problem‐solving method and has received considerable interest from academics and practitioners in various domains. The proliferation of crowdsourcing has made it much simpler to utilize the intelligence and adaptability of many people to learn new knowledge to solve the problem of acquiring new knowledge. In the past, numerous crowdsourcing works have highlighted multiple aspects; however, no surveys have been conducted that focus on the entire crowdsourcing process. This concentrated survey provides a comprehensive review of the technical advances from a systematic perspective. This survey systematically reviews technical advances for a crowdsourcing process that contains four dimensions: task modeling, crowdsourcing data acquisition, the learning process, and predictive model learning, and proposes a comprehensive and scalable framework from CROWD4AI (Crowdsourcing Framework with 4 Dimensions for Artificial Intelligence). In addition, this paper focuses on each dimension's potential challenges and future direction, encouraging researchers to participate in crowdsourcing. To bridge theory with practice, we also include a detailed case study that demonstrates the real‐world application of our proposed framework in the context of annotating cultural heritage damages using crowdsourced input. The case study illustrates how the framework supports effective task design, label collection, robust learning strategies, and accurate predictive modeling in a practical setting.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Technologies &gt; Crowdsourcing</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"52 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning and Deep Learning Techniques to Detect Mental Stress Using Various Physiological Signals: A Critical Insight 利用各种生理信号检测精神压力的机器学习和深度学习技术:一个关键的见解
Pub Date : 2025-07-14 DOI: 10.1002/widm.70035
Megha Khandelwal, Arun Sharma
This paper presents a comprehensive review on the various techniques and methodologies employed to detect stress among individuals. The review encompasses a broad spectrum of methods, including physiological measurements, wearable technology, machine learning and deep learning algorithms, and contactless image‐based techniques. The paper outlines the physiological markers commonly associated with stress, such as Electrocardiogram (ECG), Electroencephalography (EEG), Photoplethysmography (PPG), and Skin Galvanic response. It examines the various wearable and contactless techniques to acquire data. Furthermore, it explores the integration of machine learning and deep learning techniques for the development of predictive stress detection models, highlighting their accuracy. It also addresses the potential of multispectral and hyperspectral imaging in this area. Some of the publicly available datasets are also discussed in this paper.This article is categorized under: Application Areas > Health Care Technologies > Machine Learning
本文提出了对各种技术和方法的全面审查,用于检测个人之间的压力。该综述涵盖了广泛的方法,包括生理测量、可穿戴技术、机器学习和深度学习算法,以及基于非接触式图像的技术。本文概述了通常与应激相关的生理指标,如心电图(ECG)、脑电图(EEG)、光容积脉搏波(PPG)和皮肤电反应。它检查了各种可穿戴和非接触式技术来获取数据。此外,它还探讨了机器学习和深度学习技术的集成,以开发预测应力检测模型,突出其准确性。它还讨论了多光谱和高光谱成像在该领域的潜力。本文还讨论了一些公开可用的数据集。本文分类如下:应用领域>;医疗保健技术;机器学习
{"title":"Machine Learning and Deep Learning Techniques to Detect Mental Stress Using Various Physiological Signals: A Critical Insight","authors":"Megha Khandelwal, Arun Sharma","doi":"10.1002/widm.70035","DOIUrl":"https://doi.org/10.1002/widm.70035","url":null,"abstract":"This paper presents a comprehensive review on the various techniques and methodologies employed to detect stress among individuals. The review encompasses a broad spectrum of methods, including physiological measurements, wearable technology, machine learning and deep learning algorithms, and contactless image‐based techniques. The paper outlines the physiological markers commonly associated with stress, such as Electrocardiogram (ECG), Electroencephalography (EEG), Photoplethysmography (PPG), and Skin Galvanic response. It examines the various wearable and contactless techniques to acquire data. Furthermore, it explores the integration of machine learning and deep learning techniques for the development of predictive stress detection models, highlighting their accuracy. It also addresses the potential of multispectral and hyperspectral imaging in this area. Some of the publicly available datasets are also discussed in this paper.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Application Areas &gt; Health Care</jats:list-item> <jats:list-item>Technologies &gt; Machine Learning</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144629783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Survey on Efficient Vision‐Language Models 高效视觉语言模型研究综述
Pub Date : 2025-07-14 DOI: 10.1002/widm.70036
Gaurav Shinde, Anuradha Ravi, Emon Dey, Shadman Sakib, Milind Rampure, Nirmalya Roy
Vision‐language models (VLMs) integrate visual and textual information, enabling a wide range of applications such as image captioning and visual question answering, making them crucial for modern AI systems. However, their high computational demands pose challenges for real‐time applications. This has led to a growing focus on developing efficient vision‐language models. In this survey, we review key techniques for optimizing VLMs on edge and resource‐constrained devices. We also explore compact VLM architectures, frameworks, and provide detailed insights into the performance–memory trade‐offs of efficient VLMs. Furthermore, we establish a GitHub repository at MPSC‐GitHub to compile all surveyed papers, which we will actively update. Our objective is to foster deeper research in this area.This article is categorized under: Fundamental Concepts of Data and Knowledge > Big Data Mining Technologies > Internet of Things Technologies > Artificial Intelligence
视觉语言模型(vlm)集成了视觉和文本信息,实现了广泛的应用,如图像字幕和视觉问答,使其成为现代人工智能系统的关键。然而,它们的高计算需求给实时应用带来了挑战。这使得人们越来越关注开发高效的视觉语言模型。在本调查中,我们回顾了在边缘和资源受限设备上优化vlm的关键技术。我们还探讨了紧凑的VLM架构、框架,并提供了高效VLM的性能内存权衡的详细见解。此外,我们在MPSC - GitHub上建立了一个GitHub存储库来编译所有被调查的论文,我们将积极更新。我们的目标是促进这一领域的深入研究。本文分类如下:数据和知识的基本概念>;大数据挖掘技术;物联网技术;人工智能
{"title":"A Survey on Efficient Vision‐Language Models","authors":"Gaurav Shinde, Anuradha Ravi, Emon Dey, Shadman Sakib, Milind Rampure, Nirmalya Roy","doi":"10.1002/widm.70036","DOIUrl":"https://doi.org/10.1002/widm.70036","url":null,"abstract":"Vision‐language models (VLMs) integrate visual and textual information, enabling a wide range of applications such as image captioning and visual question answering, making them crucial for modern AI systems. However, their high computational demands pose challenges for real‐time applications. This has led to a growing focus on developing efficient vision‐language models. In this survey, we review key techniques for optimizing VLMs on edge and resource‐constrained devices. We also explore compact VLM architectures, frameworks, and provide detailed insights into the performance–memory trade‐offs of efficient VLMs. Furthermore, we establish a GitHub repository at MPSC‐GitHub to compile all surveyed papers, which we will actively update. Our objective is to foster deeper research in this area.This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item>Fundamental Concepts of Data and Knowledge &gt; Big Data Mining</jats:list-item> <jats:list-item>Technologies &gt; Internet of Things</jats:list-item> <jats:list-item>Technologies &gt; Artificial Intelligence</jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"23 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144622234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
WIREs Data Mining and Knowledge Discovery
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1