首页 > 最新文献

WIREs Data Mining and Knowledge Discovery最新文献

英文 中文
Quantum Frontiers in the Battle for Information Integrity 信息完整性之战中的量子前沿
Pub Date : 2026-02-01 DOI: 10.1002/widm.70067
Vincenzo Loia, Stefania Tomasiello
In an era of rapid digital communication, the proliferation of manipulated information has emerged as a critical global challenge that undermines the integrity of information. Misinformation, often spread unintentionally, and disinformation, deliberately crafted to deceive, have far-reaching consequences, including eroding public trust, disrupting democratic processes, and endangering public health. Various forms, such as fake news, manipulated media, fake reviews, spam, and phishing, exploit social media and communication platforms to mislead users. Numerous techniques have been developed to detect false content, as discussed in several review articles devoted to the topic, but without mentioning quantum computing approaches. Notably, recent quantum computing reviews have not addressed misinformation or disinformation-related applications, despite growing interest in quantum methods across domains such as medicine, finance, and cybersecurity. This gap and the presence of relevant literature, especially over the last 2 years, highlight a pressing need for surveying research works into the intersection of quantum computing and misinformation or disinformation detection, which this work aims to address.
在快速数字通信的时代,被操纵的信息的扩散已成为破坏信息完整性的重大全球挑战。错误信息往往是无意中传播的,而虚假信息则是故意制造的,具有深远的影响,包括侵蚀公众信任、破坏民主进程和危害公众健康。虚假新闻、操纵媒体、虚假评论、垃圾邮件和网络钓鱼等各种形式利用社交媒体和通信平台误导用户。已经开发了许多技术来检测虚假内容,正如专门讨论该主题的几篇评论文章所讨论的那样,但没有提到量子计算方法。值得注意的是,尽管医学、金融和网络安全等领域对量子方法的兴趣日益浓厚,但最近的量子计算评论并没有解决与错误信息或虚假信息相关的应用。这一差距和相关文献的存在,特别是在过去的两年里,突出了对量子计算与错误信息或虚假信息检测交叉的调查研究工作的迫切需要,这是本工作旨在解决的问题。
{"title":"Quantum Frontiers in the Battle for Information Integrity","authors":"Vincenzo Loia, Stefania Tomasiello","doi":"10.1002/widm.70067","DOIUrl":"https://doi.org/10.1002/widm.70067","url":null,"abstract":"In an era of rapid digital communication, the proliferation of manipulated information has emerged as a critical global challenge that undermines the integrity of information. Misinformation, often spread unintentionally, and disinformation, deliberately crafted to deceive, have far-reaching consequences, including eroding public trust, disrupting democratic processes, and endangering public health. Various forms, such as fake news, manipulated media, fake reviews, spam, and phishing, exploit social media and communication platforms to mislead users. Numerous techniques have been developed to detect false content, as discussed in several review articles devoted to the topic, but without mentioning quantum computing approaches. Notably, recent quantum computing reviews have not addressed misinformation or disinformation-related applications, despite growing interest in quantum methods across domains such as medicine, finance, and cybersecurity. This gap and the presence of relevant literature, especially over the last 2 years, highlight a pressing need for surveying research works into the intersection of quantum computing and misinformation or disinformation detection, which this work aims to address.","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"58 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146110182","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fairness Definitions in Language Models Explained 解释语言模型中的公平性定义
Pub Date : 2026-01-14 DOI: 10.1002/widm.70063
Zhipeng Yin, Zichong Wang, Avash Palikhe, Wenbin Zhang
Language Models (LMs) have demonstrated exceptional performance across various Natural Language Processing (NLP) tasks. Despite these advancements, LMs can inherit and amplify societal biases related to sensitive attributes such as gender and race, limiting their adoption in real‐world applications. Therefore, fairness has been extensively explored in LMs, leading to the proposal of various fairness notions. However, the lack of clear agreement on which fairness definition to apply in specific contexts and the complexity of understanding the distinctions between these definitions can create confusion and impede further progress. To this end, this paper proposes a systematic survey that clarifies the definitions of fairness as they apply to LMs. Specifically, we begin with a brief introduction to LMs and fairness in LMs, followed by a comprehensive, up‐to‐date overview of existing fairness notions in LMs and the introduction of a novel taxonomy that categorizes these concepts based on their transformer architecture: encoder‐only, decoder‐only, and encoder‐decoder LMs. We further illustrate each definition through experiments, showcasing their practical implications and outcomes. Finally, we discuss current research challenges and open questions, aiming to foster innovative ideas and advance the field. The repository is publicly available online at https://github.com/vanbanTruong/Fairness‐in‐Large‐Language‐Models/tree/main/definitions . This article is categorized under: Commercial, Legal, and Ethical Issues > Fairness in Data Mining Commercial, Legal, and Ethical Issues > Social Considerations Technologies > Artificial Intelligence .
语言模型(LMs)在各种自然语言处理(NLP)任务中表现出优异的性能。尽管取得了这些进步,但LMs可能会继承和放大与性别和种族等敏感属性相关的社会偏见,从而限制了它们在现实世界中的应用。因此,在lm中对公平性进行了广泛的探讨,并提出了各种公平性概念。然而,在具体情况下应用哪一个公平定义以及理解这些定义之间的区别的复杂性方面缺乏明确的共识,可能会造成混乱并阻碍进一步的进展。为此,本文提出了一个系统的调查,澄清公平的定义,因为他们适用于LMs。具体来说,我们首先简要介绍了LMs和LMs中的公平性,然后对LMs中现有公平性概念进行了全面的、最新的概述,并介绍了一种新的分类法,该分类法根据它们的转换器架构对这些概念进行了分类:仅编码器、仅解码器和编码器解码器LMs。我们通过实验进一步说明每个定义,展示它们的实际含义和结果。最后,我们讨论了当前的研究挑战和开放问题,旨在培养创新思想和推进该领域。该存储库可在https://github.com/vanbanTruong/Fairness‐in‐Large‐Language‐Models/tree/main/definitions上公开获取。本文被分类为:商业、法律和伦理问题;数据挖掘中的公平性;商业、法律和伦理问题;社会考虑技术;人工智能。
{"title":"Fairness Definitions in Language Models Explained","authors":"Zhipeng Yin, Zichong Wang, Avash Palikhe, Wenbin Zhang","doi":"10.1002/widm.70063","DOIUrl":"https://doi.org/10.1002/widm.70063","url":null,"abstract":"Language Models (LMs) have demonstrated exceptional performance across various Natural Language Processing (NLP) tasks. Despite these advancements, LMs can inherit and amplify societal biases related to sensitive attributes such as gender and race, limiting their adoption in real‐world applications. Therefore, fairness has been extensively explored in LMs, leading to the proposal of various fairness notions. However, the lack of clear agreement on which fairness definition to apply in specific contexts and the complexity of understanding the distinctions between these definitions can create confusion and impede further progress. To this end, this paper proposes a systematic survey that clarifies the definitions of fairness as they apply to LMs. Specifically, we begin with a brief introduction to LMs and fairness in LMs, followed by a comprehensive, up‐to‐date overview of existing fairness notions in LMs and the introduction of a novel taxonomy that categorizes these concepts based on their transformer architecture: encoder‐only, decoder‐only, and encoder‐decoder LMs. We further illustrate each definition through experiments, showcasing their practical implications and outcomes. Finally, we discuss current research challenges and open questions, aiming to foster innovative ideas and advance the field. The repository is publicly available online at <jats:ext-link xmlns:xlink=\"http://www.w3.org/1999/xlink\" xlink:href=\"https://github.com/vanbanTruong/Fairness-in-Large-Language-Models/tree/main/definitions\">https://github.com/vanbanTruong/Fairness‐in‐Large‐Language‐Models/tree/main/definitions</jats:ext-link> . This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Commercial, Legal, and Ethical Issues &gt; Fairness in Data Mining </jats:list-item> <jats:list-item> Commercial, Legal, and Ethical Issues &gt; Social Considerations </jats:list-item> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> </jats:list> .","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"51 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145961760","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial Intelligence for Road Anomaly Detection: A Review 道路异常检测的人工智能研究进展
Pub Date : 2026-01-06 DOI: 10.1002/widm.70054
Rohit Samanta, Amutha Sadasivan, Muthu Subash Kavitha, Surendiran Balasubramanian
Road safety is a critical issue due to its significant impact on public health and economic stability. Traffic accidents result in millions of fatalities and injuries globally each year, imposing substantial healthcare costs and loss of productivity. Therefore, systematic data collection is urgently needed to identify key road safety challenges and implement effective solutions. This study examines recent advancements in artificial intelligence (AI) and deep learning techniques for detecting road anomalies, including potholes and speed bumps, utilizing cost‐effective, commercially available cameras. It provides a comprehensive overview of various methodologies for detecting road damage, emphasizing the value of integrating visual, qualitative, and quantitative analyses. Additionally, the study evaluates various algorithms, including R‐CNN (Regions with CNN) for object detection and CrackU‐net for crack detection, to analyze their effectiveness in enhancing road maintenance and safety. Beyond technical methods, the study also examines global trends in road safety, emphasizing the need for comprehensive policy frameworks and knowledge transfer from developed to developing countries to reduce fatalities and enhance road infrastructure. Finally, the study addresses challenges such as limited visibility, adverse weather conditions, and the current limitations of existing models, while discussing the potential for future advancements in automated road safety systems. This article is categorized under: Technologies > Artificial Intelligence
道路安全是一个关键问题,因为它对公共卫生和经济稳定有重大影响。交通事故每年在全球造成数百万人死亡和受伤,造成大量医疗保健费用和生产力损失。因此,迫切需要系统地收集数据,以确定关键的道路安全挑战并实施有效的解决方案。本研究考察了人工智能(AI)和深度学习技术在检测道路异常(包括坑洼和减速带)方面的最新进展,这些技术利用具有成本效益的市售摄像头。它提供了检测道路损坏的各种方法的全面概述,强调整合视觉,定性和定量分析的价值。此外,该研究还评估了各种算法,包括用于物体检测的R - CNN(带CNN的区域)和用于裂缝检测的CrackU - net,以分析它们在增强道路维护和安全方面的有效性。除技术方法外,该研究还审查了道路安全方面的全球趋势,强调需要建立全面的政策框架,并从发达国家向发展中国家转移知识,以减少死亡人数和加强道路基础设施。最后,该研究解决了能见度有限、恶劣天气条件和现有模型当前的局限性等挑战,同时讨论了自动化道路安全系统未来发展的潜力。本文分类如下:技术&人工智能
{"title":"Artificial Intelligence for Road Anomaly Detection: A Review","authors":"Rohit Samanta, Amutha Sadasivan, Muthu Subash Kavitha, Surendiran Balasubramanian","doi":"10.1002/widm.70054","DOIUrl":"https://doi.org/10.1002/widm.70054","url":null,"abstract":"Road safety is a critical issue due to its significant impact on public health and economic stability. Traffic accidents result in millions of fatalities and injuries globally each year, imposing substantial healthcare costs and loss of productivity. Therefore, systematic data collection is urgently needed to identify key road safety challenges and implement effective solutions. This study examines recent advancements in artificial intelligence (AI) and deep learning techniques for detecting road anomalies, including potholes and speed bumps, utilizing cost‐effective, commercially available cameras. It provides a comprehensive overview of various methodologies for detecting road damage, emphasizing the value of integrating visual, qualitative, and quantitative analyses. Additionally, the study evaluates various algorithms, including R‐CNN (Regions with CNN) for object detection and CrackU‐net for crack detection, to analyze their effectiveness in enhancing road maintenance and safety. Beyond technical methods, the study also examines global trends in road safety, emphasizing the need for comprehensive policy frameworks and knowledge transfer from developed to developing countries to reduce fatalities and enhance road infrastructure. Finally, the study addresses challenges such as limited visibility, adverse weather conditions, and the current limitations of existing models, while discussing the potential for future advancements in automated road safety systems. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145902466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Privacy‐Preserving Threat Intelligence Model for Secure Healthcare Data Sharing in the Cloud 用于云端安全医疗保健数据共享的隐私保护威胁情报模型
Pub Date : 2026-01-06 DOI: 10.1002/widm.70064
I. Sakthidevi, G. Fathima
In the contemporary healthcare landscape, secure and efficient data sharing is paramount, especially when utilizing cloud‐based platforms. The advent of cloud computing has revolutionized healthcare data sharing, offering unparalleled accessibility and scalability. However, the inherent risks associated with data breaches and privacy violations pose significant challenges, necessitating robust security measures. In such scenarios, the integration of threat intelligence with privacy‐preserving techniques becomes imperative to safeguard sensitive healthcare information. This research introduces a novel algorithm, FedGANet, alongside an integrated Privacy‐Preserving Threat Intelligence Model (FedGAN‐PPTIM), developed to strengthen secure healthcare data exchange within cloud and IoMT environments. FedGANet enhances traditional security paradigms by jointly leveraging Generative Adversarial Networks (GANs) to synthesize realistic threat scenarios and Federated Learning (FL) to enable decentralized model training without exposing sensitive patient data. The model further aligns with interoperability considerations, supporting seamless integration into diverse clinical ecosystems. The proposed FedGAN‐PPTIM framework is extensively compared with established privacy‐preserving and threat intelligence approaches across multiple evaluation metrics, including privacy leakage, threat detection rate, false positive rate, and communication overhead. The simulation analysis demonstrates that FedGANet outperforms existing methods, significantly reducing privacy leakage and communication overhead while maintaining high threat detection rates and low false positive rates. These results underscore the efficacy of FedGANet in addressing privacy and security challenges in healthcare data sharing. This article is categorized under: Technologies > Cloud Computing Technologies > Artificial Intelligence Commercial, Legal, and Ethical Issues > Security and Privacy
在当代医疗保健领域,安全和高效的数据共享至关重要,尤其是在使用基于云的平台时。云计算的出现彻底改变了医疗保健数据共享,提供了无与伦比的可访问性和可伸缩性。然而,与数据泄露和隐私侵犯相关的固有风险构成了重大挑战,需要强有力的安全措施。在这种情况下,威胁情报与隐私保护技术的集成对于保护敏感的医疗保健信息变得势在必行。本研究引入了一种新的算法FedGANet,以及一个集成的隐私保护威胁情报模型(FedGAN - PPTIM),旨在加强云和IoMT环境中的安全医疗数据交换。FedGANet通过联合利用生成对抗网络(gan)来综合现实威胁场景和联邦学习(FL)来增强传统的安全范式,从而在不暴露敏感患者数据的情况下实现分散的模型训练。该模型进一步与互操作性考虑相一致,支持与各种临床生态系统的无缝集成。提出的FedGAN - PPTIM框架在多个评估指标上与现有的隐私保护和威胁情报方法进行了广泛的比较,包括隐私泄漏、威胁检测率、误报率和通信开销。仿真分析表明,FedGANet优于现有方法,在保持高威胁检测率和低误报率的同时,显著减少了隐私泄漏和通信开销。这些结果强调了FedGANet在解决医疗数据共享中的隐私和安全挑战方面的有效性。本文分类如下:技术>;云计算技术>;人工智能商业、法律和伦理问题>;安全和隐私
{"title":"A Privacy‐Preserving Threat Intelligence Model for Secure Healthcare Data Sharing in the Cloud","authors":"I. Sakthidevi, G. Fathima","doi":"10.1002/widm.70064","DOIUrl":"https://doi.org/10.1002/widm.70064","url":null,"abstract":"In the contemporary healthcare landscape, secure and efficient data sharing is paramount, especially when utilizing cloud‐based platforms. The advent of cloud computing has revolutionized healthcare data sharing, offering unparalleled accessibility and scalability. However, the inherent risks associated with data breaches and privacy violations pose significant challenges, necessitating robust security measures. In such scenarios, the integration of threat intelligence with privacy‐preserving techniques becomes imperative to safeguard sensitive healthcare information. This research introduces a novel algorithm, FedGANet, alongside an integrated Privacy‐Preserving Threat Intelligence Model (FedGAN‐PPTIM), developed to strengthen secure healthcare data exchange within cloud and IoMT environments. FedGANet enhances traditional security paradigms by jointly leveraging Generative Adversarial Networks (GANs) to synthesize realistic threat scenarios and Federated Learning (FL) to enable decentralized model training without exposing sensitive patient data. The model further aligns with interoperability considerations, supporting seamless integration into diverse clinical ecosystems. The proposed FedGAN‐PPTIM framework is extensively compared with established privacy‐preserving and threat intelligence approaches across multiple evaluation metrics, including privacy leakage, threat detection rate, false positive rate, and communication overhead. The simulation analysis demonstrates that FedGANet outperforms existing methods, significantly reducing privacy leakage and communication overhead while maintaining high threat detection rates and low false positive rates. These results underscore the efficacy of FedGANet in addressing privacy and security challenges in healthcare data sharing. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Technologies &gt; Cloud Computing </jats:list-item> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> <jats:list-item> Commercial, Legal, and Ethical Issues &gt; Security and Privacy </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"36 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145902467","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Review on the Consistency of AI ‐Generated Images for Interior Design Rendering 室内设计渲染中人工智能生成图像的一致性研究综述
Pub Date : 2026-01-05 DOI: 10.1002/widm.70056
Shuangyang Tan, Shasha Chen
With the advancement of generative artificial intelligence, AI‐generated image methods have experienced rapid development in interior design rendering. These methods enable the rapid generation of creative interior design renderings but accompany uncertainties in the generated images, which challenges the requirements of design renderings. Researchers have explored various approaches to enhance consistency in AI‐generated images. This review summarizes the methods and roles of generative artificial intelligence in interior design compared with traditional techniques and the relationships between the AI‐generated images and controlled parameters such as the workflow nodes, prompts, and models. Image consistency is a critical factor in the design generation process; their methods to control interior design renderings include prompts, image‐to‐image, ControlNet, IP‐Adapter, LoRA, SAM, and so forth. Much evidence reveals that ControlNet could control the positional relationship, IP‐Adapter could influence different styles, LoRA could excel in customized styles, and SAM could modify local regions. This article is categorized under: Technologies > Artificial Intelligence Commercial, Legal, and Ethical Issues > Fairness in Data Mining
随着生成式人工智能的发展,人工智能生成的图像方法在室内设计渲染中得到了快速发展。这些方法能够快速生成创造性的室内设计效果图,但也伴随着生成图像的不确定性,这对设计效果图的要求提出了挑战。研究人员已经探索了各种方法来增强人工智能生成图像的一致性。本文总结了与传统技术相比,生成式人工智能在室内设计中的方法和作用,以及人工智能生成的图像与控制参数(如工作流节点、提示和模型)之间的关系。图像一致性是设计生成过程中的关键因素;他们控制室内设计渲染的方法包括提示、图像到图像、ControlNet、IP适配器、LoRA、SAM等等。大量证据表明,ControlNet可以控制位置关系,IP‐Adapter可以影响不同的风格,LoRA可以在定制风格中表现出色,SAM可以修改局部区域。本文分类如下:技术;人工智能;商业、法律和伦理问题;数据挖掘中的公平性
{"title":"A Review on the Consistency of AI ‐Generated Images for Interior Design Rendering","authors":"Shuangyang Tan, Shasha Chen","doi":"10.1002/widm.70056","DOIUrl":"https://doi.org/10.1002/widm.70056","url":null,"abstract":"With the advancement of generative artificial intelligence, AI‐generated image methods have experienced rapid development in interior design rendering. These methods enable the rapid generation of creative interior design renderings but accompany uncertainties in the generated images, which challenges the requirements of design renderings. Researchers have explored various approaches to enhance consistency in AI‐generated images. This review summarizes the methods and roles of generative artificial intelligence in interior design compared with traditional techniques and the relationships between the AI‐generated images and controlled parameters such as the workflow nodes, prompts, and models. Image consistency is a critical factor in the design generation process; their methods to control interior design renderings include prompts, image‐to‐image, ControlNet, IP‐Adapter, LoRA, SAM, and so forth. Much evidence reveals that ControlNet could control the positional relationship, IP‐Adapter could influence different styles, LoRA could excel in customized styles, and SAM could modify local regions. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> <jats:list-item> Commercial, Legal, and Ethical Issues &gt; Fairness in Data Mining </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145897539","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Security Solutions for the Internet of Things Using Machine Learning and Deep Learning: Current Trends and Future Directions 使用机器学习和深度学习的物联网安全解决方案:当前趋势和未来方向
Pub Date : 2026-01-02 DOI: 10.1002/widm.70059
Himanshu Sharma, Prabhat Kumar, Kavita Sharma
The sudden increase in adoption of the Internet of Things (IoT) has revolutionized modern living but also brought unprecedented security challenges due to its distributed, heterogeneous, and resource‐constrained nature. This review paper offers a comprehensive examination of machine learning (ML) and deep learning (DL) approaches tailored for intrusion detection and threat mitigation in IoT ecosystems. It explores the landscape of anomaly detection and classification techniques while analyzing their suitability, limitations, and deployment feasibility across IoT layers. The study also investigates the significance of feature engineering, model selection, and system scalability. A novel addition to this review is the integration of emerging trends such as explainable AI (XAI), which enhances transparency and trust in black‐box ML/DL models, and federated learning (FL), a privacy‐preserving paradigm that allows decentralized model training without raw data sharing. The synergy between FL and Edge AI is discussed to highlight real‐time, low‐latency security analytics at the network's edge. Comparative tables, domain‐specific applications (e.g., smart homes, healthcare, and industrial IoT), and architectural illustrations support the discourse, providing readers with an up‐to‐date understanding of current capabilities and ongoing research challenges. This paper concludes with practical implications, research gaps, and future directions for building intelligent, secure, and explainable IoT security frameworks that respect user privacy and enable scalable deployment. This article is categorized under: Fundamental Concepts of Data and Knowledge > Explainable AI Technologies > Internet of Things Technologies > Machine Learning
物联网(IoT)的突然普及彻底改变了现代生活,但由于其分布式、异构和资源受限的性质,也带来了前所未有的安全挑战。这篇综述论文全面研究了为物联网生态系统中的入侵检测和威胁缓解量身定制的机器学习(ML)和深度学习(DL)方法。它探讨了异常检测和分类技术的前景,同时分析了它们在物联网层之间的适用性、局限性和部署可行性。研究还探讨了特征工程、模型选择和系统可扩展性的重要性。本综述的一个新颖补充是整合了新兴趋势,如可解释人工智能(XAI),它增强了黑盒ML/DL模型的透明度和信任,以及联邦学习(FL),这是一种保护隐私的范例,允许在没有原始数据共享的情况下进行分散的模型训练。讨论了FL和边缘AI之间的协同作用,以突出网络边缘的实时、低延迟安全分析。比较表,特定领域的应用(例如,智能家居,医疗保健和工业物联网)和建筑插图支持论述,为读者提供对当前能力和正在进行的研究挑战的最新理解。本文总结了构建智能、安全、可解释的物联网安全框架的实际意义、研究差距和未来方向,这些框架应尊重用户隐私并实现可扩展部署。本文分类如下:数据和知识的基本概念;可解释的人工智能技术;物联网技术;机器学习
{"title":"Security Solutions for the Internet of Things Using Machine Learning and Deep Learning: Current Trends and Future Directions","authors":"Himanshu Sharma, Prabhat Kumar, Kavita Sharma","doi":"10.1002/widm.70059","DOIUrl":"https://doi.org/10.1002/widm.70059","url":null,"abstract":"The sudden increase in adoption of the Internet of Things (IoT) has revolutionized modern living but also brought unprecedented security challenges due to its distributed, heterogeneous, and resource‐constrained nature. This review paper offers a comprehensive examination of machine learning (ML) and deep learning (DL) approaches tailored for intrusion detection and threat mitigation in IoT ecosystems. It explores the landscape of anomaly detection and classification techniques while analyzing their suitability, limitations, and deployment feasibility across IoT layers. The study also investigates the significance of feature engineering, model selection, and system scalability. A novel addition to this review is the integration of emerging trends such as explainable AI (XAI), which enhances transparency and trust in black‐box ML/DL models, and federated learning (FL), a privacy‐preserving paradigm that allows decentralized model training without raw data sharing. The synergy between FL and Edge AI is discussed to highlight real‐time, low‐latency security analytics at the network's edge. Comparative tables, domain‐specific applications (e.g., smart homes, healthcare, and industrial IoT), and architectural illustrations support the discourse, providing readers with an up‐to‐date understanding of current capabilities and ongoing research challenges. This paper concludes with practical implications, research gaps, and future directions for building intelligent, secure, and explainable IoT security frameworks that respect user privacy and enable scalable deployment. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Fundamental Concepts of Data and Knowledge &gt; Explainable AI </jats:list-item> <jats:list-item> Technologies &gt; Internet of Things </jats:list-item> <jats:list-item> Technologies &gt; Machine Learning </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"41 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2026-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145894662","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Counterfactual Explanations in Education: A Systematic Review 教育中的反事实解释:系统回顾
Pub Date : 2025-12-29 DOI: 10.1002/widm.70060
Pamela Buñay‐Guisñan, Juan A. Lara, Cristóbal Romero
Counterfactuals are a type of explanations based on hypothetical scenarios used in Explainable Artificial Intelligence (XAI), showing what changes in input variables could have led to different outcomes in predictive problems. In the field of education, counterfactuals enable educators to explore various hypothetical scenarios, facilitating informed decision‐making and the application of educational strategies for improving students' academic performance or reducing dropout rates, among others. Despite the gradual expansion of research on counterfactuals in education, systematic literature reviews on this topic remain scarce. The identification of the most relevant advancements in this field can provide a deep insight into the current state of research, highlighting the most effective areas and revealing opportunities for future studies. The objective of this research is to conduct a systematic literature review, using the PRISMA methodology, to analyze three aspects regarding the use of counterfactuals in education: the problems that counterfactuals help to address in education, the methods and/or algorithms used to generate them, and how the counterfactuals are presented in the educational context. As a result, we have identified a series of key challenges and opportunities for future research over the next few years, which constitute the main contribution of this paper. This article is categorized under: Application Areas > Education and Learning Algorithmic Development > Causality Discovery Fundamental Concepts of Data and Knowledge > Explainable AI
反事实是一种基于可解释人工智能(XAI)中使用的假设场景的解释,显示输入变量的哪些变化可能导致预测问题的不同结果。在教育领域,反事实使教育工作者能够探索各种假设情景,促进知情决策和教育策略的应用,以提高学生的学习成绩或降低辍学率等。尽管对教育中的反事实的研究逐渐扩大,但关于这一主题的系统文献综述仍然很少。识别该领域最相关的进展可以提供对当前研究状况的深入了解,突出最有效的领域并揭示未来研究的机会。本研究的目的是进行系统的文献综述,使用PRISMA方法,分析反事实在教育中使用的三个方面:反事实有助于解决教育中的问题,用于生成它们的方法和/或算法,以及如何在教育环境中呈现反事实。因此,我们确定了未来几年未来研究的一系列关键挑战和机遇,这构成了本文的主要贡献。本文分类如下:应用领域;教育和学习算法发展;因果关系发现;数据和知识的基本概念;可解释的人工智能
{"title":"Counterfactual Explanations in Education: A Systematic Review","authors":"Pamela Buñay‐Guisñan, Juan A. Lara, Cristóbal Romero","doi":"10.1002/widm.70060","DOIUrl":"https://doi.org/10.1002/widm.70060","url":null,"abstract":"Counterfactuals are a type of explanations based on hypothetical scenarios used in Explainable Artificial Intelligence (XAI), showing what changes in input variables could have led to different outcomes in predictive problems. In the field of education, counterfactuals enable educators to explore various hypothetical scenarios, facilitating informed decision‐making and the application of educational strategies for improving students' academic performance or reducing dropout rates, among others. Despite the gradual expansion of research on counterfactuals in education, systematic literature reviews on this topic remain scarce. The identification of the most relevant advancements in this field can provide a deep insight into the current state of research, highlighting the most effective areas and revealing opportunities for future studies. The objective of this research is to conduct a systematic literature review, using the PRISMA methodology, to analyze three aspects regarding the use of counterfactuals in education: the problems that counterfactuals help to address in education, the methods and/or algorithms used to generate them, and how the counterfactuals are presented in the educational context. As a result, we have identified a series of key challenges and opportunities for future research over the next few years, which constitute the main contribution of this paper. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Application Areas &gt; Education and Learning </jats:list-item> <jats:list-item> Algorithmic Development &gt; Causality Discovery </jats:list-item> <jats:list-item> Fundamental Concepts of Data and Knowledge &gt; Explainable AI </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145847529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Functional Classification of Spiking Signal Data Using Artificial Intelligence Techniques: A Systematic Review 利用人工智能技术对尖峰信号数据进行功能分类:系统综述
Pub Date : 2025-12-27 DOI: 10.1002/widm.70053
Danial Sharifrazi, Nouman Javed, Javad Hassannataj Joloudari, Roohallah Alizadehsani, Saadat Behzadi, Prasad N. Paradkar, Ru‐San Tan, U. Rajendra Acharya, Asim Bhatti
Human brain neuron activities are incredibly significant nowadays. Neuronal behavior is assessed by analyzing signal data such as extracellular recording, which can offer scientists valuable information about diseases and neuron activities. One of the difficulties researchers confront while evaluating these signals is the existence of large volumes of spike data. Spikes are significant components of signal data that can happen as a consequence of vital biomarkers or physical issues such as electrode movements. Hence, distinguishing types of spikes is essential. From this spot, the spike classification concept commences. Previously, researchers classified spikes manually. The manual classification was not precise enough, as it involved extensive analysis. Consequently, Artificial Intelligence (AI) was introduced into neuroscience to assist clinicians in classifying spikes correctly. Recognizing noises from spikes produced by neural activity causes the spike classification task to bear a significant demand. Classifying spikes accurately and quickly reveals the role of AI in the scope of spike classification. This review provides an in‐depth discussion of the importance and use of AI in spike classification. This work organizes materials in the spike classification field for future studies and fully describes how spikes are recognized. Therefore, the existing datasets are described first. The topic of spike classification is then separated into three major components: preprocessing, classification, and evaluation. Each of these sections introduces existing methods and determines their importance. Having been summarized and compared, more efficient algorithms are highlighted. The primary goal of this work is to provide a perspective on spike classification for future research, as well as a thorough grasp of the methodologies and issues involved. In this work, numerous studies were extracted from various databases. The PRISMA‐related research guidelines were then used to choose papers. Then, research studies based on spike classification using machine learning and deep learning approaches with effective preprocessing were selected. Although there are research papers on spike sorting using the keyword spike, the primary focus of this study is on spike classification. Finally, 47 papers were selected for in‐depth review. First, useful information on the datasets for these papers is supplied. In addition, preprocessing approaches, classification methods, and ultimate performance are investigated in each of these studies. The material is then summarized. Furthermore, the fundamental concerns regarding spike classification raised in the opening of this paper are thoroughly addressed throughout the review. Our reviewing outcomes illustrate that support vector machine and clustering‐based algorithms drastically influence machine learning methods in terms of high accuracy and many uses. Moreover, convolutional neural networks, spiky neural networks, and atten
如今,人类大脑神经元的活动非常重要。神经元行为是通过分析细胞外记录等信号数据来评估的,这可以为科学家提供有关疾病和神经元活动的宝贵信息。研究人员在评估这些信号时面临的困难之一是存在大量的尖峰数据。尖峰是信号数据的重要组成部分,可能是重要生物标志物或电极运动等物理问题的结果。因此,区分尖峰的类型是必要的。从这里开始,穗分类概念开始了。以前,研究人员手动对尖峰进行分类。手工分类不够精确,因为它涉及大量的分析。因此,人工智能(AI)被引入神经科学,以帮助临床医生正确分类尖峰。从神经活动产生的脉冲中识别噪声使得脉冲分类任务承担了很大的需求。准确快速地对尖峰进行分类,揭示了人工智能在尖峰分类范围内的作用。这篇综述对人工智能在尖峰分类中的重要性和应用进行了深入的讨论。这项工作为未来的研究组织了尖峰分类领域的材料,并充分描述了如何识别尖峰。因此,首先描述现有的数据集。然后将尖峰分类的主题分为三个主要部分:预处理,分类和评估。每一部分都介绍了现有的方法,并确定了它们的重要性。通过总结和比较,突出了更有效的算法。这项工作的主要目的是为未来的研究提供一个关于刺突分类的观点,以及对所涉及的方法和问题的全面掌握。在这项工作中,从不同的数据库中提取了大量的研究。然后使用PRISMA相关研究指南来选择论文。然后,选择基于脉冲分类的机器学习和深度学习方法进行有效预处理的研究。虽然已有使用关键词spike进行spike分类的研究论文,但本研究的重点是对spike分类进行研究。最后,我们选择了47篇论文进行深入的综述。首先,提供了这些论文数据集的有用信息。此外,这些研究还探讨了预处理方法、分类方法和最终性能。然后对材料进行总结。此外,在本文开头提出的关于刺突分类的基本问题在整个审查中得到了彻底解决。我们的回顾结果表明,支持向量机和基于聚类的算法在高精度和多用途方面极大地影响了机器学习方法。此外,卷积神经网络、尖刺神经网络和基于注意力的技术可以在深度学习方法中对具有相当功能的尖刺进行分类。各种预处理和分类技术已经在实际应用中用于对医疗机构患者提取的信号数据进行分类。我们的综述强调了用机器学习和深度学习模型对神经科学应用中的峰值进行分类的重要性。这可以为使用人工智能对现实世界的医疗数据进行分类提供宝贵的见解和实践解决方案。本文分类如下:技术;人工智能技术;机器学习算法开发;生物数据挖掘
{"title":"Functional Classification of Spiking Signal Data Using Artificial Intelligence Techniques: A Systematic Review","authors":"Danial Sharifrazi, Nouman Javed, Javad Hassannataj Joloudari, Roohallah Alizadehsani, Saadat Behzadi, Prasad N. Paradkar, Ru‐San Tan, U. Rajendra Acharya, Asim Bhatti","doi":"10.1002/widm.70053","DOIUrl":"https://doi.org/10.1002/widm.70053","url":null,"abstract":"Human brain neuron activities are incredibly significant nowadays. Neuronal behavior is assessed by analyzing signal data such as extracellular recording, which can offer scientists valuable information about diseases and neuron activities. One of the difficulties researchers confront while evaluating these signals is the existence of large volumes of spike data. Spikes are significant components of signal data that can happen as a consequence of vital biomarkers or physical issues such as electrode movements. Hence, distinguishing types of spikes is essential. From this spot, the spike classification concept commences. Previously, researchers classified spikes manually. The manual classification was not precise enough, as it involved extensive analysis. Consequently, Artificial Intelligence (AI) was introduced into neuroscience to assist clinicians in classifying spikes correctly. Recognizing noises from spikes produced by neural activity causes the spike classification task to bear a significant demand. Classifying spikes accurately and quickly reveals the role of AI in the scope of spike classification. This review provides an in‐depth discussion of the importance and use of AI in spike classification. This work organizes materials in the spike classification field for future studies and fully describes how spikes are recognized. Therefore, the existing datasets are described first. The topic of spike classification is then separated into three major components: preprocessing, classification, and evaluation. Each of these sections introduces existing methods and determines their importance. Having been summarized and compared, more efficient algorithms are highlighted. The primary goal of this work is to provide a perspective on spike classification for future research, as well as a thorough grasp of the methodologies and issues involved. In this work, numerous studies were extracted from various databases. The PRISMA‐related research guidelines were then used to choose papers. Then, research studies based on spike classification using machine learning and deep learning approaches with effective preprocessing were selected. Although there are research papers on spike sorting using the keyword spike, the primary focus of this study is on spike classification. Finally, 47 papers were selected for in‐depth review. First, useful information on the datasets for these papers is supplied. In addition, preprocessing approaches, classification methods, and ultimate performance are investigated in each of these studies. The material is then summarized. Furthermore, the fundamental concerns regarding spike classification raised in the opening of this paper are thoroughly addressed throughout the review. Our reviewing outcomes illustrate that support vector machine and clustering‐based algorithms drastically influence machine learning methods in terms of high accuracy and many uses. Moreover, convolutional neural networks, spiky neural networks, and atten","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Systematic Review of Movement Tracking for Real‐Time Monitoring of Physical Exercises in the Gym 运动跟踪在健身房体育锻炼实时监测中的系统综述
Pub Date : 2025-12-19 DOI: 10.1002/widm.70057
Sahil Bhadane, Samrat Ganguly, Musaddik Karanje, Dhanush Rachaveti, S. Amutha, B. Surendiran
In recent years, the amalgamation of computer vision and deep learning technologies has led to the advancement of fitness and health‐related movement tracking in gyms. Such advancements have resulted in exercise‐related analyses within the gym environment. These analyses were made possible by collecting real‐time movement data from people working in the gym, such as kinematics, kinetics, EMG, and so forth. Further, real‐time feedback was provided using movement data to avoid injuries while working in the gym. The newly emerging field of movement tracking in the gym uses technologies that could improve workout accuracy and optimization in the fitness routine. Further, a broad spectrum of recent research assesses computer vision techniques and deep learning models to evaluate physical performance and create real‐time corrective feedback and monitoring systems. The review addresses innovative noncontact and contact‐based monitoring systems that could capture movement patterns and their specific datasets. Furthermore, the article highlights the challenges in real‐world gym settings, such as lighting variations, occlusion by gym equipment or people, and the high computational requirements of real‐time processing. The article also elaborates on different methods and models used for movement tracking in the gym and their advantages and disadvantages. Hence, such a review emphasizes the emergence of transformative computer vision and deep learning technology to revolutionize the fitness domain. This article is categorized under: Application Areas > Health Care Technologies > Artificial Intelligence Fundamental Concepts of Data and Knowledge > Human Centricity and User Interaction
近年来,计算机视觉和深度学习技术的融合推动了健身和健康相关运动跟踪在健身房的发展。这些进步已经在健身房环境中产生了与运动相关的分析。通过收集在健身房工作的人的实时运动数据,如运动学、动力学、肌电图等,这些分析成为可能。此外,使用运动数据提供实时反馈,以避免在健身房工作时受伤。健身房运动跟踪的新兴领域使用的技术可以提高锻炼准确性和优化健身程序。此外,最近的广泛研究评估了计算机视觉技术和深度学习模型,以评估物理性能并创建实时纠正反馈和监控系统。这篇综述讨论了创新的非接触和基于接触的监测系统,这些系统可以捕获运动模式及其特定数据集。此外,本文还强调了现实世界中健身房设置的挑战,例如照明变化,健身房设备或人员遮挡,以及实时处理的高计算要求。文章还详细阐述了健身房运动跟踪的不同方法和模型及其优缺点。因此,这样的回顾强调了变革性计算机视觉和深度学习技术的出现,以彻底改变健身领域。本文分类如下:应用领域>;医疗保健技术>;人工智能数据和知识的基本概念>;以人为中心和用户交互
{"title":"A Systematic Review of Movement Tracking for Real‐Time Monitoring of Physical Exercises in the Gym","authors":"Sahil Bhadane, Samrat Ganguly, Musaddik Karanje, Dhanush Rachaveti, S. Amutha, B. Surendiran","doi":"10.1002/widm.70057","DOIUrl":"https://doi.org/10.1002/widm.70057","url":null,"abstract":"In recent years, the amalgamation of computer vision and deep learning technologies has led to the advancement of fitness and health‐related movement tracking in gyms. Such advancements have resulted in exercise‐related analyses within the gym environment. These analyses were made possible by collecting real‐time movement data from people working in the gym, such as kinematics, kinetics, EMG, and so forth. Further, real‐time feedback was provided using movement data to avoid injuries while working in the gym. The newly emerging field of movement tracking in the gym uses technologies that could improve workout accuracy and optimization in the fitness routine. Further, a broad spectrum of recent research assesses computer vision techniques and deep learning models to evaluate physical performance and create real‐time corrective feedback and monitoring systems. The review addresses innovative noncontact and contact‐based monitoring systems that could capture movement patterns and their specific datasets. Furthermore, the article highlights the challenges in real‐world gym settings, such as lighting variations, occlusion by gym equipment or people, and the high computational requirements of real‐time processing. The article also elaborates on different methods and models used for movement tracking in the gym and their advantages and disadvantages. Hence, such a review emphasizes the emergence of transformative computer vision and deep learning technology to revolutionize the fitness domain. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Application Areas &gt; Health Care </jats:list-item> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> <jats:list-item> Fundamental Concepts of Data and Knowledge &gt; Human Centricity and User Interaction </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Traditional to Foundation Models: A Survey for Land Use and Land Cover Hyperspectral Image Classification 从传统模型到基础模型:土地利用和土地覆盖高光谱图像分类综述
Pub Date : 2025-12-16 DOI: 10.1002/widm.70049
Radhesyam Vaddi, Boggavarapu Phaneendra Kumar Lakshmi Narasimha, Soma Mitra, Sushmita Mitra, Lorenzo Bruzzone, Swalpa Kumar Roy
Hyperspectral remote sensing image classification is one of the key research areas of the remote sensing community. The high dimensionality, complex structure of data, and availability of fewer training samples hinder classification performance. Traditional machine learning approaches focus mainly on feature extraction for hyperspectral image classification. The complex relationships among pixels, nonlinearity, and material complexity could not be established with these approaches. This results in a suboptimal solution for fewer training samples in hyperspectral images. Recent advances in deep architectures provide means to improve performance and analyze complex patterns effectively, which were challenging with traditional approaches. The present research systematically describes deep learning models, from basic convolutional neural networks to transfer learning, ensemble learning, attention networks and graph nets. Also, advanced transformer approaches such as Mamba architectures, foundation models and vision‐language models for hyperspectral images with a specific emphasis on land use and land cover mapping. These advanced approaches provide efficient classification and real‐time processing capabilities that allow solutions to other different real‐world applications like agriculture, urban mapping, forestry, and the environment. This research also compares key state‐of‐the‐art methodologies, highlights research challenges, and offers future directions for efficient and accurate classification. This review endorses assimilating multisource data, developing lightweight models for resource‐constrained environments, and progressing explainable deep learning frameworks to improve classification performance. This research also serves as a useful reference for researchers in the hyperspectral remote sensing community, supporting the determination of the most appropriate classification technique specific to a particular remote sensing application. This article is categorized under: Technologies > Classification Technologies > Machine Learning Technologies > Artificial Intelligence
高光谱遥感影像分类是遥感界的重点研究领域之一。数据的高维数、复杂的结构和较少的训练样本阻碍了分类性能。传统的机器学习方法主要集中在高光谱图像分类的特征提取上。这些方法无法建立像素、非线性和材料复杂性之间的复杂关系。这将导致高光谱图像中训练样本较少的次优解决方案。深度体系结构的最新进展提供了提高性能和有效分析复杂模式的方法,这是传统方法所面临的挑战。本研究系统地描述了深度学习模型,从基本卷积神经网络到迁移学习、集成学习、注意网络和图网络。此外,先进的变压器方法,如曼巴架构、基础模型和高光谱图像的视觉语言模型,特别强调土地利用和土地覆盖制图。这些先进的方法提供了高效的分类和实时处理能力,使解决方案适用于其他不同的现实世界应用,如农业、城市测绘、林业和环境。本研究还比较了最先进的关键方法,突出了研究挑战,并为有效和准确的分类提供了未来的方向。这篇综述支持吸收多源数据,为资源受限环境开发轻量级模型,并推进可解释的深度学习框架以提高分类性能。该研究也为高光谱遥感领域的研究人员提供了有益的参考,支持确定针对特定遥感应用的最合适的分类技术。本文分类如下:技术>;分类技术>;机器学习技术>;人工智能
{"title":"From Traditional to Foundation Models: A Survey for Land Use and Land Cover Hyperspectral Image Classification","authors":"Radhesyam Vaddi, Boggavarapu Phaneendra Kumar Lakshmi Narasimha, Soma Mitra, Sushmita Mitra, Lorenzo Bruzzone, Swalpa Kumar Roy","doi":"10.1002/widm.70049","DOIUrl":"https://doi.org/10.1002/widm.70049","url":null,"abstract":"Hyperspectral remote sensing image classification is one of the key research areas of the remote sensing community. The high dimensionality, complex structure of data, and availability of fewer training samples hinder classification performance. Traditional machine learning approaches focus mainly on feature extraction for hyperspectral image classification. The complex relationships among pixels, nonlinearity, and material complexity could not be established with these approaches. This results in a suboptimal solution for fewer training samples in hyperspectral images. Recent advances in deep architectures provide means to improve performance and analyze complex patterns effectively, which were challenging with traditional approaches. The present research systematically describes deep learning models, from basic convolutional neural networks to transfer learning, ensemble learning, attention networks and graph nets. Also, advanced transformer approaches such as Mamba architectures, foundation models and vision‐language models for hyperspectral images with a specific emphasis on land use and land cover mapping. These advanced approaches provide efficient classification and real‐time processing capabilities that allow solutions to other different real‐world applications like agriculture, urban mapping, forestry, and the environment. This research also compares key state‐of‐the‐art methodologies, highlights research challenges, and offers future directions for efficient and accurate classification. This review endorses assimilating multisource data, developing lightweight models for resource‐constrained environments, and progressing explainable deep learning frameworks to improve classification performance. This research also serves as a useful reference for researchers in the hyperspectral remote sensing community, supporting the determination of the most appropriate classification technique specific to a particular remote sensing application. This article is categorized under: <jats:list list-type=\"simple\"> <jats:list-item> Technologies &gt; Classification </jats:list-item> <jats:list-item> Technologies &gt; Machine Learning </jats:list-item> <jats:list-item> Technologies &gt; Artificial Intelligence </jats:list-item> </jats:list>","PeriodicalId":501013,"journal":{"name":"WIREs Data Mining and Knowledge Discovery","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145759509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
WIREs Data Mining and Knowledge Discovery
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1